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TITLE: Method for detecting transient ligand interactions 

FIELD OF THE INVENTION 

The invention relates to methods for separating and/or analyzing 
5 ligands in an affinity matrix system. In particular, the invention relates to 
separating substoichiometrically interacting ligands from high abundance 
ligand of interest in an affinity matrix system. 

BACKGROUND OF THE INVENTION 

An important scientific pursuit in the post-genomic era is to determine 

10 the components of multiprotein complexes and to identify the protein-protein 
interactions In a proteome. The proteome consists of stable protein 
complexes and single (free) proteins but virtually every cellular process is 
mediated by and/or consists in transient protein interactions, e.g. protein- 
protein interactions, DNA-protein interactions or RNA-protein interactions. Tlie 

15 detection and analysis of the transient interactions is a major challenge in 
bi ology and proteomics . 

One method for protein purification and detection of protein interactions 
is creating fusion proteins by using recombinant DNA techniques. For 
example, the fusion protein can contain a protein of interest and an affinity tag 

20 capable of binding selectively to an affinity matrix. After expressing the fusion 
protein in exogenous or endogenous organism, it is immobilized on an affinity 
matrix and the unbound substances are removed. Subsequently, the fusion 
protein is released from the affinity matrix. The method is used in order to 
purity the affinity tagged protein or, more recently, the affinity tagged protein 

25 and the proteins that associate with it. The advent of sensitive mass 
spectrometric methods for protein identification and improved affinity 
purification methods made possible the direct identification of protein 
complexes on a proteome-wide scale (Ho Y. et al Nature 2002, Gavin, A.C. et 
al. Nature 2002). A review by R. Aebersold and M. Mann. Nature, March 

30 2003, Vol. 422, "Mass spectrometry-based proteomics" describes in more 
detail the state of the field. 
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More than 300 stable complexes have been identified in the model 
organism Saccharomyces cerevisiae and extensive proteome maps have 
been built. However, it became clear that the number of transient protein- 
protein interactions detected by affinity purification coupled with mass 
5 spectrometry is smaller (often by an order of magnitude) compared to the 
number of transient protein-protein interactions detected by genetic based 
methods and library based methods. This is due to the fact that when the 
fusion protein forms transient complexes with other proteins, the latter are 
isolated in substoichiometric amounts and most often are not detected. 

10 When the fusion protein and the interacting proteins are isolated via an 

affinity tag, the bulk of the fusion protein is immobilized on the affinity matrix, 
whereas only that fraction of the interacting proteins which has been bound to 
the fusion protein in vivo is immobilized. When the fusion protein is released 
from the affinity matrix, the ratio between the proteins in the eiuate remains 

15 the same and, as a result, the highly abundant fusion protein suppresses the 
identification and analysis of the substoichiometrically interacting proteins. 
The detection of transient protein-protein interactions (e.g. Interactions 
between stable protein complexes or between two single proteins or between 
a stable protein complex and a single protein) is still very difficult and In most 

20 cases impossible. Examples of proteins involved in transient protein-protein 
interactions include protein substrates, modifying enzymes such as protein 
kinases, signal transducers, DNA polymerase, replication factors, RNA 
polymerase, and transcription factors. 

The following example with RNA polymerase II and the transcription 

25 factors illustrates the difficulty of detecting transient protein-protein 
interactions and the general problem of detecting any transient protein 
interaction by using any affinity tagged protein from any organism. 

Eukaryotic RNA polymerase II is a permanent complex of twelve 
subunits and it forms transient complexes with different transcription factors. 

30 When a SpA-tagged subunit of RNA Polymerase II core complex from 
Sacctiaromyces cerevisiae is expressed under physiological conditions, it 
associates pennanently with other subunits of the core complex and, as a part 
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of the core complex, it interacts directly or indirectly with a variety of 
transcription factors by forming transient (weak) complexes. After preparation 
of a protein lysate and Immobilization of the SpA-tagged subunit on IgG- 
beads, the other subunlts of the core complex are present in approximately 
5 equal molar amounts but the transcription factors are present in 
substoichiometric amounts. 

Several factors contribute to the unequal stoichiometries including: (a) 
during the preparation of the cellular lysate and subsequent purification steps, 
transient complexes containing the RNA Polymerase II core complex 

10 dissociate In different ways and each core complex remains associated with a 
different set of transcription factors, (b) during the synthesis of mRNA, the 
RNA Polymerase II core complex associates with different proteins at different 
stages of the transcription, e.g. initiating factors, mediator complex, elongation 
factors, temiination factors, (c) during the transcription of different genes, 

15 RNA Polymerase II core complex associates with different sets of gene 
specific transcription factors, and (d) a certain amount of the RNA Polymerase 
II core enzyme is not Involved In transcription and is not associated with other 
proteins. 

As a result, the amounts of the subunlts of the core complex on the 
20 affinity matrix are disproportionably higher than the amounts of the interacting 
proteins. The isolation of the complexes by releasing the SpA-tagged subunit 
from the IgG-sepharose matrix and their analysis by mass spectrometry leads 
to Identification of the subunlts of the core complex, subunlts of the general 
transcription factor TFHF and, in some cases, Spt5, TaflO, Cegl, Essi, Kln28 
25 and Srb6. However, more than 60 proteins that interact physically with the 
RNA polymerase II core enzyme are reported in the literature (Myer V.E. and 
Young R.A., J.Biol.Chem 1998). 

Even when the isolated RNA Polymerase II complexes are fractionated 
by electrophoresis, some of the twelve subunlts of the core complex (most 
30 often Rpbl and Rpb2) and their different modification forms and degradation 
products are present In large amounts in many bands and they suppress the 
identification and/or analysis of low abundance proteins by mass 
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spectrometry. In this case, the mass spectrum contains very large peaks, 
resulting from the high abundance proteins (i.e. subunits of Polymerase II 
core complex), and small peaks, resulting from the low abundance Interacting 
proteins (i.e. transcription factors). Most often, the small peaks are not even 

■ 

5 detected and the analysis of the spectrum leads to identification of only the 
major protein(s). This is due to the following problems in mass spectrometry: 

(i) The resolution problem In mass spectrometry instmments. If a 
large peak and small peak are separated by only a few mass units 
(i.e. their molecular weight differs by only 3-4 Da), the detection of 

10 the latter is impossible. 

(ii) The dynamic range problem in mass spectrometry (high 
background noise problem). The signals from the abundant 
peptides suppress the signal from the minor proteins, even if they 
are separated by more than 3-4 mu. The background of the mass 

"•5 spectrum is due to the presence of lai^e numbers of ions with low 

intensity. They are mostly multiple-charge Ions and fragment ions 
from the major peptides and they are not detected as signals but as 
a background noise in the mass spectrum. In addition, adducts that 
are bound to the abundant peptides increase the background 

20 (chemical noise). 

(iii) Ionization of the MALDI-TOF samples (co-crystallized peptides and 
a UV-matrix) is most optimal when the molar ratio between the 
matrix and the peptides is between 2,000:1 and 20,000:1. Hence, it 
is impossible to achieve a good ionization, and consequentially a 

25 good spectrum, for both high abundance and low abundance 

peptides, when the ratio between their amounts is 100:1 or 1000:1 
or higher. 

The example demonstrates the problems that are encountered when 
transient interactions are investigated by affinity purification coupled with 
30 mass spectrometry. The problem is even more severe in gel-less protein 
identification approaches (e.g. LS/MS/MS) because of the absence of initial 
separation step (PAGE) of the isolated proteins. 
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Besides the dynamic range problem in mass spectrometry, otfier 
problems arise when low abundance proteins or other substances are 
analyzed together with high abundance ones: (a) cross-reactivity problem in 
Immunoassays, (b) overloaded chromatography columns or overloaded 
5 protein gels, and as a result, poor resolution, (c) ambiguous results in 
enzymatic assays. 

The present invention solves these problems in an affinity matrix 
system by separating the substoichiometrically interacting proteins and/or 
other biomolecules from the high abundance protein of interest. 

10 SUMMARY OF THE INVENTION 

The present invention relates to methods of separating, and optionally 
analyzing, ligands that form transient complexes with other ligands in vivo. 

In one example, the invention relates to methods of separating a first 
ligand from a second ligand in an affinity matrix system, wherein the first 
15 ligand associates in vivo with the second ligand and wherein the first ligand 
and the second ligand associate with each other by electrostatic forces, 
comprising the steps: 

(a) obtaining a sample containing biological complexes that include the 
first ligand and the second ligand; 
20 (b) immobilizing the second ligand on an affinity matrix; 

(c) removing unbound substances from the affinity matrix; 

(d) separating the first ligand from the immobilized second ligand, 
which remains bound to the affinity matrix during the separation, by 
decreasing the electrostatic forces between the first ligand and the 

25 second ligand; and 

(e) optionally, analyzing the separated first ligand. 

In another example, the invention relates to methods of separating a 
first ligand flrom a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand, wherein the firet ligand and 
30 the second ligand associate with each other by electrostatic forces, and 

wherein the second ligand is a recombinant fusion protein containing at least 
one affinity tag, comprising the steps: 
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(a) introducing into a cell or organism a recombinant nucleic acid 
molecule encoding a fusion protein comprising the second ligand 
fused to at least one affinity tag that can selectively bind to an 
affinity matrix; 
5 (b) expressing the fusion protein; 

(c) obtaining a sample containing biological complexes that include the 
first ligand and the fusion protein; 

(d) immobilizing the fusion protein on the affinity matrix via the affinity 
tag; 

10 (e) removing unbound substances firom the affinity matrix; 

(f) separating the first ligand from the Immobilized fusion protein, which 
remains bound to the affinity matrix during the separation, by 
decreasing the electrostatic forces between the first ligand and the 
fusion protein; and 
1 5 (g) optionally, analyzing the separated first ligand . 

In a further example, the invention relates to methods of separating a 
first ligand firom a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand, wherein the first ligand and 
the second ligand associate with each other by electrostatic forces, and 
20 wherein the second ligand is a recombinant fusion protein containing at least 
two different affinity tags, comprising the steps: 

(a) introducing into a cell or organism a recombinant nucleic acid 
molecule encoding a fusion protein comprising the second ligand 
fused to at least two different affinity tags that can selectively bind 

25 to different affinity matrixes; 

(b) expressing the fusion protein; 

(c) obtaining a sample containing biological complexes that include the 
first ligand and the fusion protein; 

(d) immobilizing the fusion protein on a first affinity matrix via a first 
30 affinity tag; 

(e) removing unbound substances firom the first affinity matrix; 

(f) separating the fijsion protein from^ the first affinity matrix; 
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(g) immobilizing the fusion protein on a second affinity matrix via a 
second affinity tag, which is different than the first affinity tag; 

(h) removing unbound substances from the second affinity matrix; 

(i) separating the first ligand from the immobilized fusion protein, which 
5 remains bound to the second affinity matrix during the separation, 

by decreasing the electrostatic forces between the first ligand and 
the fusion protein; and 
0) optionally, analyzing the separated first ligand. 
In another example, the Invention relates to methods of separating a 
10 first ligand from a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand, wherein the first ligand and 
the second ligand associate with each other by electrostatic forces, and 
wherein the second ligand is a protein complex containing two or more 
subunits of which are fused to different affinity tags, comprising the steps: 
15 (a) introducing Into a cell or organism recombinant nucleic acids 

molecules encoding fusion proteins comprising the two or more 
subunits of which are fused to different affinity tags that can 
selectively bind to different affinity matrixes; 
(b) expressing the fusion proteins; 
20 (c) obtaining a sample containing biological complexes that include the 

first ligand and the fusion proteins; 

(d) immobilizing the fusion proteins on a first affinity matrix via a first 
affinity tag; 

(e) removing unbound substances from the first affinity matrix; 
25 (f) separating the fusion proteins from the first affinity matrix; 

(g) Immobilizing the fusion proteins on a second affinity matrix via 
second affinity tag, which is different than the first affinity tag; 

(h) removing unbound substances firom the second affinity matrix; 

(i) separating the first ligand from the Immobilized fusion proteins, 
30 which remain bound to the second affinity matrix during the 

separation, by decreasing the electrostatic forces between the first 
ligand and the fusion proteins; and 
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(j) optionally, analyzing the separated first ligand. 

The method of the invention is suitable for detecting transient protein- 
protein interactions, and other transient interactions that involve proteins, 
including but not limited to DNA-protein interactions, RNA-protein interactions, 
carbohydrate-protein interactions, and lipid-protein interactions. 

Another aspect of the invention is a method for drug discovery. A 
chemical or biomolecule can be identified as a drug or a pre-drug by its ability 
to selectively affect a particular protein interaction that is associated with a 
disease. 

Other features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, 
however, that the detailed description and the specific examples while 
indicating preferred embodiments of the invention are given by way of 
illustration only, since various changes and modifications within the spirit and 
scope of the invention will become apparent to those skilled in the art from 
this detailed description. 

BRIEF DES CRIPTION OF THE DRAWIMftfi 

The Invention will now be described in relation to the drawings In 

which: 

Figure 1 shows exploded views of a chromatography column packed 
with ligand coated solid support: 1 - chromatography column packed with 
affinity beads, 2 - an expanded view of several individual beads. 3 - an 
expanded view of an individual bead of affinity matrix, 4 - solid support (e.g. 
agarose or sepharose), 5 - ligand (e.g. IgG or glutathione) that binds 
selectively to affinity tag. 6 - linker between the solid support and the ligand. 

Figure 2 illustrates the two essential steps (bold vertical arrows) of the 
invention Affinity Purification with Switching the Ligand (APSL). Legend: 7 - 
linker between the protein of interest and the affinity tag, 8 - affinity tag tiiat is 
fused to a protein of interest; 9 - protein of Interest or subunit of a pennanent 
protein complex that is affinity tagged (i.e. fijslon protein); 10. 11, 12, 13. 14. 
15 - proteins that interact directly with the fusion protein or protein complex;' 
19 - protein that Interacts indirectly with the fusion protein or protein complex,' 
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1 6 - the fusion protein or a protein complex and the interacting proteins are 
immobilized on the affinity matrix; 17 - the fusion protein or protein complex 
remains immobilized on the affinity matrix after the elution; 18 - the interacting 
proteins are separated into the liquid phase after the elution (the eluate is 
5 indicated with a dotted line); 20 - Step One: preparation of cellular lysate or 
other biological fluid from an organism or cell line containing a heterologous 
nucleic acid expressing an affinity tagged protein and immobilizing the affinity 
tagged protein on the affinity matrix by the tag; removing the proteins and 
other substances that are not bound directly or Indirectly to the fusion protein; 

10 21 - Step Two: separating the interacting proteins from the immobilized fusion 
protein by elution with an agent that does not disrupt the binding between the 
affinity matrix and the immobilized fusion protein or subunit of a permanent 
protein complex; only proteins and other substances that bind directly or 
indirectly to the immobilized protein are eluted. The bonds between some of 

15 the affinity tags and their respective ligands are mainly hydrophobic and it is 
therefore reasonable to Increase the Ionic strength of the medium in Step Two 
in order to weaken the electrostatic attractions by Debye-Huckel screening. 

Figure 3 Is a gel Image of high salt eluate from immobilized RNA 
Polymerase II from Saccharomyces cerevisiae. The fusion protein contains 

20 Rpb1 and a SpA tag containing four IgG binding domains. The digits and 
arrows at the right side of the gel indicate the identified protein. Legend: 31 - 
Spt6. 32 - Rgri, 33 - Spt6 + Spt5. 34- Tfgl + Spt5. 35- Set3. 36 - Cetl + 
Medl, 37 - Pob3, 38 - Npl3', 39 - Tfig2 + Iwsl, 40 - Cdc73. 41 - TFIIS. 42 - 



30 



25 Figure 4 is a gel image of high salt eluate from immobilized RNA 

Polymerase II from Saccharomyces cerevisiae. The fusion protein contains 
Rpb1 and a SpA tag containing two IgG binding domains. Legend: 51 - Spt5 
+ Spt6, 52 - Spt5 + Spte, 53 - Spt5 + Spt6, 54 - Ctr9 + Spt5 + Spt6, 55 - Spt5 
+ Ydl145 (Copi) + Spte. 56 - Tfgl + Spt5 + Spt6 + Spt16. 57 - Spt5 + Tfgl + 
Sec21, 58 - Spt5 + Tfgl, 59 - Set2 + Spt5 + Rtfl + Ydl145 (Copi), 60 - Cetl, 
61 - Tfg2 + Iwsl, 62 - Cegl. 63 - Cdc73. 64 - TFIIS, 65 - Rtt103, 66 - Tfg3, 67 
-Yhr121w, 68-Ess1. 
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Figure 5 is a gel Image which Identifies proteins that bind 
nonspeciflcally to IgG beads and/or to other proteins (contaminants). Legend: 
70 - Acci + Fasi + Fas2. 71 - Lys2 + Ydr098, 72 - IVIdnl + Tra1 . 73 - Sec27 
+ Ade3. 74 - Prti + Lys2. 75 - Sse1 + Lsg1. 76 - Ssb1 + Ssa2 + Pabi + 
6 Dedl . 77 - Hsp60 + Dhhl . 78 - Gcd1 1 + Dedl + Nop12 + Nog2, 79 - Tef2 + 
Phrl, 80 - Tef2 + L3. 81 - Nop3 + Tef2, 82 - Yor060 + Ydji. 83 - Stm1 + 
Rpl4, 84 - Tif34, 85 - RppO + Rpl5. 86 - Tef2 + Nopi . 87 - Rpsi + Rpl2. 88 - 
Rps3 + Rpl8. 89 - Rps2. 90 - Rps5. 91 - Rps8 + Rpl9. 92 - RpllO, 93 - 
Rps20, 94 - Rpl20, 95 - Rps14. 96 - Rps17. 97 - Rpl27 + Rps24. 
10 Figure 6 Is a gel image of high salt eluates from immobilized RNA 

Polymerase II complexes from Saccharomyces cerevisiae expressing a Rpbl- 
Sp-A tagged fusion protein. Legend: 120 - Spt6, 121 - Spt6, 122 - Spt5 + 
Cyri + CIul, 123 - Tfgl + Nam7 + Spt16. 124 - Fcp1 + Cdc48 + Sec21, 125 

- Taf5 + Top2, 126 - Cetl + Pcfl + Prp6, 127 - Pob3, 128 - Rtt103 + Rc3 + 
16 Hasl. 129 - Tfg2 + Iwsl + Papl. 130 - Tfg2 + Iwrl; 131 - Rail. 132 - Sua7. 

133 - TFIIS, 134 - Ypl253, 135 - Ssn3 + Mrt4. 136 - Tfg3 + Ptcl. 137 - 
Med7. 138 - Yhr121w. 139 - Mbfl + Taf9 + Yor379c, 140 - Ybr262c + 
Ybr174c. 

Figure 7 is a gel image of high salt eluate from immobilized RNA 
20 Polymerase 11 complexes from Saccharomyces cemvisiae expressing a Rpb3- 

Sp-A tagged fusion protein. Legend: 150 - Spt6, 151 - Spt5 + Ctr9 + New1. 

152 - Spt16 + Tor2, 153 - Tfgl + Hpr5 + Nam7, 154 - Fcp1 + Cdc48 + 

Nam7. 155 - Fcpl + Rad3, 156 - Fcpl + Taf5 + Pbpl, 157 - Cetl + Cyr1. 

158 - Pob3. 159 - Rtt103 + Ptc3. 160 - TFIIH + Rad23 + Drsi, 161 - Tfg2 i 
25 Iwsl. 162 - Rail + Pob3 + Rgri + Ygr086c, 163 - Fun11 + Subl. 164 - 

TFIIB, 165 - TFIIS + Yol045, 166 - Tfg3. 167 - Dln7, 168 - Yhr121w. 169 - 

Nhp2, 170 -Mbfl. 

Figure 8 is a gel image of high salt eluate from immobilized RNA 
Polymerase II complexes fi-om Saccharomyces cerevisiae expressing a Rpb9- 
30 Sp-A tagged fusion protein. Legend: 180 - Spt6 + Stt4 + Ydl145 (Copi). 181 

- Spt5 + New1 + Tofi, 182 - Spt5 + Rad3. 183 - Tfgl + Cdc48 + Rgri + 
Nam7, 184 - Fcpl + Cdc48 + Radi + Sec21, 185 - Fcpl + Sup35 + Rap1. 
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186 - Cetl + Rapl + Cdc6. 187 - Tfg2 + Iwsl, 188 - Cegl + Kini + Prp8. 
189 - Cegl + Rgml. 190 - Ygr086c + Ydr065w, 191 - TFIIB. 192 - TFIIS + 
Thol + Bimi , 193 - Tfg3 + Tfa2 (TFIIE), 194 -Set3 + Tfb2 (TFIIH) + Yol022c. 
Cdc6. Prp2, Prp16, Rifl and Cfl1 were also detected. 
5 Figure 9 is a gel image of high salt eluate from immobilized TFIIF from 

Saccharomyces cerevisiae. The carboxy-terminus of the chromosomal copy 
of the gene encoding Tgfl is fused in-frame with a TAP-tag. Legend: 201 - 
Rpb1, 202 - Rpb1+Rpb2. 203 - Fcpl + Rpb2 + Rpbl. 204 - Rpbl + Rpb2 + 
Fcpl . 205 - Glo3, 206 - Rpb3, 207 - TFIIS. 208 - Mpd2. 209 - Rpb4. 210 - 

10 Yhr121w + Rpb5. 21 1 - Rpb6. 212 - Rpb7, 213 - Mbfl + Rpb8. 214 - Rpb9. 

Figure 10 Is a gel Image of high salt eluate from immobilized RNA 
Polymerase II from Saccharomyces cerevisiae. The carboxy-terminus of the 
chromosomal copy of the gene encoding Rpbl is fused in-frame with a TAP- 
tag. Legend: 221 - Rpbl. 222 - Spt6. 223 - Spt6. 224 - Rpb2 + Ctr9. 225 - 

15 Ctr9 + Spt6. 226 - Tfgl + Prp22 + Spt16 + Spt6, 227 - Tfgl + Fcpl + Spt16 + 
Nipl . 228 - Fcpl + Set2 + Rtf1 + Ygl244, 229 - Tfg1 + Fcpl + Ygr054 + Leol 
+ Med1, 230 - Pob3, 231 - Pafi + Glo3, 232 - Iwsl + Tfg2, 233 -TFg2, 234 - 
Sua7 (TFIIB homolog). 235 - TFIIS, 236 - Mpd2 . 237 - Tfg3. 238 - Essi, 239 
- Mbfl . 

20 Figure 1 1 is a gel image of high salt eluate from Immobilized RNA 

Polymerase II from Saccharomyces cerevisiae (right lane) and the 
subsequentially released core complex (left lane). The carboxy-terminus of 
the chromosomal copy of the gene encoding Rpbl is fused In-frame with a 
TAP-tag. Legend: 271 - Spt6, 272 - Ydl145 (Copi). 273 - Ctr9 + Spt5, 274 - 

25 Tfg1 + Spt16 + Ltel, 275 - Tfgl + Spt5 + Spt16 + Fcpl, 276 - Set2, 277 - 
Fcpl + Leol, 278 - Pob3. 279 - Paf1, 280 ~ Pafi + Rtt103, 281 - Tfg2 + 
Iwsl. 282 - Tfg2, 283 - Cdc73 + Cegl, 284 - Dstl. 285 - Rtt103, 286 - T1g3, 
287 - Essi. 288 - Rpbl, 289 - Rpb2, 290 - Tfgl, 291 - Tfg2. 292 - Rpb3, 
293 - Tfg3. 294 - Rpb4, 295 - Rpb5, 296 - Rpb6. 297 - Rpb7, 298 - Rpb8. 

30 Figure 12 is a gel Image of the high salt eluate from immobilized Rpb3 

from Saccharomyces cerevisiae. The carboxy-terminus of the chromosome 
copy of the gene encoding Rpb3 is fused in-frame with a TAP-tag. The 
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Interactlng proteins are indicated. Trace amounts of Rpbl , Rpb2 and Rpb3 
are also indicated. 

Figure 13 sliows a comparison between the method of the invention 
and other methods for detecting protein-protein interactions that utilize affinity 
5 tagged proteins. Legend: 301 - the left panel illustrates GST pull down, 302 - 
the middle panel illustrates purification via an affinity tag, 303 - the right panel 
illustrates the method of the invention. 310 - exogenous organism (e.g. 
Escherichia coli). 311 - endogenous organism. 312 - affinity tagged protein. 
313 - the affinity tagged protein and the interacting proteins are immobilized 

10 on solid phase, 314 - unbound proteins and/or other substances (solid black) 
are removed, 315 - isolating the interacting proteins - only in the case of the 
invention they are separated from the affinity tagged protein (the eluate does 
not contain affinity tagged protein). 316 - eluate obtained by GST pull down; 
317 - eluate obtained by purification via an affinity tag; 18 - eluate obtained by 

15 the method of the invention. 

Figure 14 illustrates the necessity to separate substoichiometrically 
interacting proteins from the high abundance fusion protein. The bull< of the 
fusion protein is Immobilized via an affinity tag whereas only the fraction of 
interacting proteins that has been bound to the fusion protein in vivo is 

20 immobilized. 330 - transient complexes are immobilized on an affinity matrix 
via an affinity tag, 331 - purification via an affinity tag, 332 - purification by 
the method of the invention, 317 - eluate obtained by purification via an 
affinity tag, 18 - eluate obtained by the method of the invention. 

Figure 15 Illustrates the formation and disintegration of a transient 

25 protein complex between two stable (pemianent) protein complexes (first and 
second vertical arrows respectively). Legend: 351 - stable protein complex 
with 4 subunits, 352 - stable protein complex with 6 subunits, 353 - transient 
protein complex between the two stable complexes, 354 - subunit of a stable 
complex that binds indirectly to another stable complex, 355 - formation of a 

30 transient protein complex, 356 - disintegration of the complex. 

Figure 16 is a model for transient protein-protein interactions according 
to which fomiation and disintegration of transient protein complexes is due to 
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alternation of electrostatic attraction and electrostatic repulsion between the 
electrostatically charged amino acids. Legend: 370 - first interacting protein. 
371 - second interacting protein. 372 - positively charged amino acids. 373 - 
negatively charged amino acids. 374 - electrostatic attraction. 375 - two 
interacting proteins fomi a complex. 376 - change of position or orientation of 
the two interacting proteins relative to one another. 377 - electrostatic 
repulsion, 378 - transient complex disintegrates. 

Figure 17 illustrates that when a mutation changes the electrostatic 
properties of the protein, the transient interaction cannot occur properly 
Legend for the upper panel: 380 - mutation leading to disappearance of 
positively charged amino acid. 381 - electrostatic attraction is prevented. 
Legend for the lower panel: 382 - mutation leading to disappearance of 
different positively charged amino acid. 383 - electrostatic repulsion is 
prevented and the interacting proteins remain stuck together. 

Figure 18 shows a method for drug design and schematic structure of a 
chemical or a biomolecule that binds to a mutant protein and restores (at least 
partially) its electrostatic properties. As a result, the protein is capable of 
forming a protein complex. Legend: 390 - drug treatment, 391 - a drug. 392 - 
an exploded view of the drug. 393 - part that mimics and/or restores the 
original charge, 394 - linker (optional). 395 - part that binds selectively to the 
mutant protein. 

Figure 19 illustrates the selective effect of a drug on a particular 
protein-protein interaction. Legend: 430- left panel: the immobilized protein of 
interest is treated with nonspecific elutlon agent and all interacting proteins 
are separated. 431 - middle panel: the immobilized protein of interest is 
treated with specific elutlon agent which causes separation of only one 
interacting protein complex. 432 - right panel: the unrelated protein complex 
Is treated with the same specific elutlon agent but there is no separation of the 
interacting proteins. 438 - unrelated protein. 439 - protein interacting with the 
unrelated protein. 449 - another protein interacting with the unrelated protein 
440 - elutlon with nonselective elutlon agent (e.g. increased salt 
concentration). 441 - elution with selective agent (e.g. drug treatment) 18 - 
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eluate obtained by the method of the invention, i.e. proteins eluted by 
treatment with a nonselective elutlon agent, 445 - proteins eluted by 
treatment with a selective agent; 446 - the unrelated protein complex is not 
affected by the treatment with a selective agent and the interacting proteins 
5 are not eluted. 

Figure 20 illustrates elution by a gradient and collecting different 
fractions and analyzing each one (by immunodetection for example) for the 
presence of the affinity tag or the affinity tagged protein: 401 - 
chromatography column; 402 - nine subsequent fractions, small amount of 
10 each one are blotted to a membrane and analyzed; 41 1. 412. 413, 414, 415, 
416 - blots conresponding to fractions that do not contain affinity tagged 
protein. 417, 418. 419 - blots corresponding to fractions that contain 
• Increasing amounts of the affinity tagged protein. 

Figure 21 illustrates the problems with analyzing fractions or gel bands 
containing proteins with different abundances by mass spectrometry. After the 
trypsin digest, the concentration range of the peptides remains approximately 
the same and It results in mass spectrum containing peaks with different 
Intensity. Left panel: When the mass spectrum contains large and small peaks 
separated by only a few (1-5) mass units, the detection of the latter is 
20 impossible because of the dynamic range problem in mass spectrometry. 
Analyzing such spectrum leads to the identification of only the major protein(s) 
in the band. Right panel: High background noise problem in mass 
spectrometry - when a protein mixture with high dynamic range (ratio between 
the most abundant and the least abundant protein) is analyzed directly the 
25 small peptides are not detected because their intensities are lower than the 
background. Legend: 450 - high abundance peptide; 451 - different low 
abundance peptides; 452 - background of the mass spectrum; 453 - low 
abundance peptide is not detected because It is obscured by a high 
abundance peptide which mass is two mass units smaller; 454 - the 
30 separation of the substoichiometrically interacting proteins from the high 
abundance protein of interest results in lower background (due to the absence 
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of the high abundance peptides) and makes possible the detection of the low 
abundance peptides. 

Figure 22 shows the two subunlts of the RNA Polymerase II core 
complex (Rpbl and Rpb2) co-purified with Rtt103. Legend: 470 - Rpb1, 471 - 
5 Rpb2. 472 - Rati , 473 - Rtt1 03. 474 - Rai 1 . 

DETAILED DESCR IPTION OF THE INVENTinM 

The invention relates to methods for separating, and optionally 
analyzing, ligands in an affinity matrix system. In particular, the invention 
relates to separating, and optionally analyzing, substoichiometrically 
10 interacting ligands from the high abundance ligand of interest in an affinity 
matrix system. 

In one example, the invention relates to methods of separating a first 
ligand fi-om a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand and wherein the first ligand 
1 5 and the second ligand associate with each other by electrostatic forces, 
comprising the steps: 

(a) obtaining a sample containing biological complexes that include the 
first ligand and the second ligand; 

(b) immobilizing the second ligand on an affinity matrix; 
20 (c) removing unbound substances from the affinity matrix; 

(d) separating the first ligand from the immobilized second ligand, 
which remains bound to the affinity matrix during the separation, by 
decreasing the electrostatic forces between the first ligand and the 
second ligand; and 
25 (e) optionally, analyzing the separated first ligand. 

In another example, the invention relates to methods of separating a 
first ligand from a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand, wherein the first ligand and 
the second ligand associate with each other by electrostatic forces, and 
30 wherein the second ligand is a recombinant fusion protein containing at least 
one affinity tag, comprising the steps: 
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(a) introducing into a cell or organism a recombinant nucleic acid 
molecule encoding a fusion protein comprising the second ligand 
fused to at least one affinity tag that can selectively bind to an 
affinity matrix; 
5 (b) expressing the fusion protein; 

(c) obtaining a sample containing biological complexes that include the 
first ligand and the fusion protein; 

(d) immobilizing the fusion protein on the affinity matrix via the affinity 
tag; 

10 (e) removing unbound substances firom the affinity matrix; 

(0 separating the first ligand from the immobilized fusion protein, which 
remains bound to the affinity matrix during the separation, by 
decreasing the electrostatic forces between the first ligand and the 
fusion protein; and 
15 (g) optionally, analyzing the separated first ligand. 

One distinctive feature of the invention is that the second ligand 
remains immobilized on the affinity matrix (directly or indirectly), but the first 
ligand, which is the interacting molecule of interest, is separated into a liquid 
phase, which can be analyzed separately. 
20 Unless otherwise indicated, the terms "high abundance", "low 

abundance", "high amount" and "low amount" designate the amounts of the 
ligands when they are immobilized on the affinity matrix but not their amounts 
in the organism or cell. 

The definition of a "transient protein complex" as used herein is a 
25 complex with components that exist as both complexes and free proteins 
through electrostatic interactions. 

The temn "transient ligand complex" as used herein refers to a complex 
of components that exist as both complexes and free ligands through 
electrostatic interactions. 

30 The tenn "interacting ligand" as used herein refers to the first ligand, 

which is the ligand that forms a transient ligand complex with the ligand of 
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interest (second ligand) and it is not to be confused with the subunits of a 
pemrianent ligand complex. 

The term "ligand of Interesf as used herein refers to the second ligand 
and designates the ligand whose partners are sought. 
5 The tenn "associated ligand" as used herein refers to ligands that form 

either stable or transient complexes with the ligand of interest (second ligand). 

The term "ligand" as used herein refers to a molecule that can bind 
another molecule and includes, but is not limited to proteins, RNA, DMA, lipids 
and carbohydrates. 

10 The term "affinity matrix system" as used herein means a system 

where a particular ligand is immobilized or coupled to a solid support so that 
when a sample comes into contact with immobilized ligand, only those 
molecules in the sample having specific binding affinity to the ligand are 
bound to the immobilized ligand. An affinity matrix system makes use of 
16 specific binding interactions between molecules. 

The following temis, synonyms and abbreviations are used herein: 
protein interaction = interaction involving protein (e.g. protein-protein, 
DNA-protein, RNA-protein, carbohydrate-protein, lipid-protein); 

ligand interaction = interaction involving ligands (e.g. proteins, DNA, 
20 RNA, carbohydrates and/or lipids); 

protein of interest = protein whose partners are sought; 
ligand of interest = ligand whose partners are sought = second ligand; 
weak protein interaction = transient protein interaction; 
weak ligand interaction = transient ligand interaction; 
25 stable protein complex = pennanent protein complex; 

stable ligand complex = permanent ligand complex; 
high salt eluate = eluate that has been obtained by increasing the ionic 
strength of the medium surrounding the immobilized complexes; 

SpA-tag = affinity tag consisting of one or more IgG binding domains 
30 genetically derived from Protein A from Staphylococcus aureus; 

SpG-tag = affinity tag consisting of one or more IgG binding domains 
genetically derived flrom Streptococcal protein G (strain G148); 
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unbound substances = substances that are not immobilized directly or 
indirectly on the affinity matrix; 

protein complex = biological complex containing one or more protein 
components; 

5 ligand complex = biological complex containing one or more ligand 

components; 

washing buffer = the buffer that is used to remove the unbound 
substances; 

elution buffer = the buffer that is used to separate the interacting ligand 
10 (first ligand) from the immobilized ligand of interest (second ligand). In an 
example of the method of the invention, the protein of interest (second ligand) 
Is a fusion protein and the elution buffer is washing buffer with an increased 
concentration of KCI or other salt and/or other substances. 

pre-drug = drug candidate = chemical that is tested in order to 
1 5 determine whether it selectively affects a particular complex associated with a 
disease; 

SDS-PAGE = polyacrylamide gel electrophoresis in the presence of 
sodium dodecyl sulphate; 

M = molar (concentration); 
20 mM = millimolar; 

mcM = micromolar; 
ml = milliliter; 
mcl = microliter; 

rpm = revolutions per minute (during centrifugation); 
26 MBP s maltose binding peptide; 

SBP = streptavidin binding peptide; 
WB = washing buffer; 
ORF = open reading frame; 

contaminant protein = protein that binds nonspecifically to the protein 
30 of interest or to any protein or to the affinity matrix; and 

MALDI-TOF MS = matrix assisted laser desorption ionization / time of 
flight mass spectrometry. 
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As Stated above the term iigand complex means a biological complex 
containing one or more Iigand components. The ligands can be proteins, 
nucleic acids, lipids or carbohydrates. The ligands can be modified (e.g.* 
proteins can be glycoproteins or lipoproteins). 

The term "biological complex" means an association of ligands in a 
biological sample. The ligands can include proteins. DNA. RNA. 
carbohydrates and/or lipids present in a biological sample. 

The term "sample containing biological complexes" or "biological 
sample" as used herein means a cellular lysate. a cell lysate. a fraction of a 
cellular lysate or biological fluid that contains biological complexes. The 
sample can be from any animal, in particular humans, and may be biological 
fluids (such as blood, saliva or cerebrospinal fluid), tissue, hair or organs. In a 
preferred embodiment, the sample is a cell lysate. In one example, the cell 
lysate is obtained from a cell sample (e.g. culture of cells, tissue, or biological 
fluid) by using mechanical forces on the cell sample, such as grinding the 
frozen sample, glass beads beating, brief sonicatlon or homogenization. In 
addition, lysis buffer can be used, such as 2x Lysis Buffer (200mM KCI. 50mM 
HEPES pH 7.4. 20% glycerol. 2% DMSO. 0.2% TritonXlOO, 2mM PMSF, 
2mM benzamidine, 2x phosphatase inhibitors). It is advisable to keep the 
temperature between 0°C and 10°C, and maintain the pH optimally close to a 
physiological pH. 

The invention can be carried out with whole cells, cellular organelles, 
cellular lysate, or other biological fluids. In addition, the invention can be 
perfomied with fusion proteins and interacting ligands expressed in a cell free 

26 system. Cellular lysate can be fractionated in order to obtain a fraction that is 
enriched in the Iigand of interest. When the organism is a multicellular 
organism, the Immobilized fusion protein can originate from one tissue but the 
interacting ligands can originate from a different tissue (e.g. hormones). In this 
case, the biological fluid containing the interacting ligands can be applied to 

30 the fusion protein before or after the immobilization of the fusion protein. 
When the biological fluid containing the interacting ligands is applied after the 
immobilization of the fusion protein and removal of the unbound substances, 
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the Immobilized fusion protein can be associated with llgands from Its original 
tissue or , the associated ligands can be removed before applying the other 
biological fluid originating from a different tissue. The fusion protein can be 
cross-linked to the affinity matrix In order to minimize the leakage. 
5 The temri "immobilizing" as used herein means the second ligand is 

Immobilized to the affinity matrix either directly or Indirectly. For example, If 
the second ligand is a glycoprotein, then it can be immobilized directly to an 
affinity matrix coated with lectin (e.g. lectin coated beads). In other examples, 
the second ligand can be indirectly immobilized to the affinity matrix. For 

10 example, the second ligand can be joined to an affinity tag, such as. but not 
limited to. SpA. SpG. GST, SBP, SBP. poly-His or Flag, which can selectively 
bind to the respective affinity matrix. In addition, the second ligand can be 
indirectly Immobilized to the affinity matrix via antibodies specific for the 
second ligand. For example, if the anti-second ligand antibody is IgG type 

15 then the antibody can be immobilized to a Protein A Sepharose affinity matrix, 
and then the second ligand can be Indirectly Immobilized to the affinity matrix 
via the anti-second ligand specific antibody. In another example, the anti- 
second ligand antibody can be biotinylated, and then the biotinylated antibody 
can be immobilized to a streptavldin affinity matrix, and then the second 

20 ligand can be indirectly immobilized to the affinity matrix via the anti-second 
ligand specific antibody. 

The term "antibody" as used herein Is Intended to include monoclonal 
antibodies and polyclonal antibodies, antibody fragments (e.g. Fab and 
F(ab')2, and single chain antibodies (scFV)). and chimeric antibodies, which 

25 also specifically react with a ligand, such as the second ligand in the method 
of the invention. Antibodies can be fragmented using conventional techniques 
and the fragments screened for utility. For example. F(ab')2 fragments can be 
generated by treating antibody with pepsin. The resulting F(ab')2 fragment 
can be treated to reduce disulfide bridges to produce Fab' fragments. Papain 

30 digestion can lead to the fomiation of Fab fragments. Single chain antibodies 
combine the antigen-binding regions of an antibody on a single stably folded 
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polypeptide chain. Single chain antibodies can be generated by recombinant 
technology. 

Antibodies having specificity for a ligand may be prepared by 
conventional methods. A mammal, (e.g. a mouse, hamster, or rabbit) can be 
5 immunized with an immunogenic form of the peptide which elicits an antibody 
response In the mammal. Techniques for conferring Immunogeniclty on a 
peptide include conjugation to carriers or other techniques well known In the 
art. For example, the peptide can be administered in the presence of 
adjuvant. The progress of immunization can be monitored by detection of 

10 antibody titers in plasma or serum. Standard ELISA or other immunoassay 
procedures can be used with the immunogen as antigen to assess the levels 
of antibodies. Following immunization, antisera can be obtained and. if 
desired, polyclonal antibodies Isolated from the sera. 

To produce monoclonal antibodies, antibody producing cells 

15 (lymphocytes) can be harvested from an immunized animal and fused with 
myeloma cells by standard somatic cell fusion procedures thus immortalizing 
these cells and yielding hybridoma cells. Such techniques are well known in 
the art, (e.g. the hybridoma technique originally developed by Kohler and 
Mllstein (Nature 256:495-497 (1 975)) as well as other techniques such as the 

20 human B-cell hybridoma technique (Kozbor et al., Immunol.Today 4:72 
(1983)), the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole et al.. Methods Enzymol, 121:140-67 (1986)), and screening 
of combinatorial antibody libraries (Huse et al., Science 2^6:1275 (1989)). 
Hybridoma cells can be screened Immunochemically for production of 

25 antibodies specifically reactive with the peptide and the monoclonal antibodies 
can be isolated. 

Specific antibodies, or antibody fragments, reactive against ligand 
components may also be generated by screening expression libraries 
encoding immunoglobulin genes, or portions thereof, expressed in bacteria 
30 with cell surface components. For example, complete Fab fragments, VH 
• regions and FV regions can be expressed in bacteria using phage expression 
libraries (See for example Ward et al.. Nature 341:544-548 (1989); Huse et 
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al., Science 246:1275-1281 (1989); and McCafferty et al., Nature 348:552-554 
(1990)). Alternatively, a SCID-hu mouse, for example the model developed 
by Genpharm. can be used to produce antibodies, or fragments thereof. 

"Joining the second ligand to an affinity tag" as used herein means the 
5 affinity tag is joined to the second ligand directly or indirectly. For example, 
the second ligand can be joined to the affinity tag indirectly via a linker, such 
as a peptide linker, a chemical linker, a nucleic acid linker or other 
biomolecule linker. In another example, the affinity tag can be joined to the 
second ligand directly. The fusion ligand, which is the second ligand joined 

10 directly or indirectly to an affinity tag, can be made using recombinant 
technology. Methods for purifying and manipulating recombinant proteins 
including fusion proteins are well known in the art. Necessary techniques are 
explained fully in the literature, such as, "Molecular Cloning: A Laboratory 
Manual", second edition (Sambrook et al.. 1989); "Oligonucleotide Synthesis" 

15 (M. J. Gait. ed.. 1984); "Animal Cell Culture" (R. I. Freshney, ed.. 1987); 
"Methods in Enzymology" (Academic Press. Inc.); "Handbook of Experimental 
Immunology" (D. M. Weir & C. C. Blackwell. eds.); "Gene Transfer Vectors for 
Mammalian Cells" (J. M. Miller & M. P. Calos. eds., 1987); "Current Protocols 
in Molecular Biology" (F. M. Ausubel et al., eds., 1987); "PCR: The 

20 Polymerase Chain Reaction". (Mullis et al., eds., 1994); "Current Protocols in 
Immunology" (J. E. Coligan etal., eds., 1991). 

« 

The term "removing unbound substances from the affinity matrix" as 
used herein means washing the affinity matrix with the immobilized second 
ligand bound in a manner that removes unbound substances from the affinity 
25 matrix while leaving the immobilized second ligand bound to the affinity matrix 
and the first ligand still associated with the second ligand. In one example, the 
wash buffer is lOOmM KCI, lOmM HEPES pH 7.4, 5% glycerol. 1% DMSO. 
0.1%TritonX100. 

The term "separating the first ligand from the immobilized second 
30 ligand" as used herein means separating the first ligand from the immobilized 
second ligand in a manner that removes the first ligand from the second 
ligand while leaving the immobilized second ligand bound to the affinity 
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matrix. The first llgand can be separated from the immobilized second ligand 
by disrupting the electrostatic forces between the first ligand and the 
immobilized second ligand. The electrostatic forces can be disrupted by using 
an elution buffer that increases the Ionic strength of the medium. In one 
5 example, the ionic strength of the medium is increased using an elution buffer 
with an increased salt concentration, such as 0.3-0.5M potassium chloride 
(KCI) or 0.7M potassium acetate. One of skill in the art will appreciate that 
any salt that can increase the ionic strength of the medium can be used in the 
method of the invention. Examples of salts that may be used include, without 
10 being limiting, potassium chloride, sodium chloride, potassium acetate and 
sodium acetate. 

Another variation of separating the first ligand from the immobilized 
second ligand in a manner that removes the first ligand from the second 
ligand while leaving the immobilized second ligand bound to the affinity matrix 

15 Includes changing the pH of the system or enzymatic treatment that modifies 
the first ligand and/or second ligand. 

A person skilled in the art will appreciate that any pH can be used to 
separate the first ligand from the immobilized second ligand. as long as it 
does not disrupt the bond between the affinity matrix and the Immobilized 

20 second ligand. In one example, a pH range between 4 and 10 can be used to 
separate the first ligand from the immobilized second ligand. In another 
example. pH 4 can be used for separafing the first ligand, which does not 
affiect the IgG-SpA bond or IgGiSpG bond. 

A person skilled in the art will also realize that different enzymatic 

25 treatments can be used. For example, non-specific phosphatases (alkaline 
and acid) can hydrolyze almost any phosphate ester, such as CIP, which is an 
alkaline phosphatase of calf intestines that catalyzes the non-specific 
hydrolysis of phosphate monoesters. Another example is serine/threonine 
protein phosphatases. These are relatively non-specific and can be used for 

30 treatment of a wide range of proteins. New England Biolabs offers YOP 
Protein Tyrosine Phosphatase (coded by Yop51 gene of Yersinia 
enterocolitica), which Is a phospho tyrosine-specific protein phosphatase. By 
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treating the immobilized complex with different specific phosphatases, one 
can determine which phosphorylated amino acid(s) take part in the formation 
of the salt bonds between the first and second ligand. 

It is a variation of the method of the invention to add in the elution 
buffer substances that facilitate the separation of the interacting proteins from 
the protein of interest (e.g. 0.5M urea or 0.5M guanidinium chloride or SDS 
below 0.1% or TritonXlOO below 1%). but that do not disrupt the bond 
between the affinity tag and the affinity matrix. 

Elution can be performed by gradually increasing the concentration of 
the elution agent (e.g. salt gradient, pH gradient, enzyme concentration). A 
person skilled in the art will appreciate that as the concentration or strength of 
the elution agent increases, the possibility of the immobilized complex 
separating into the liquid phase increases. For example. Figure 12 shows that 
during high salt elution trace amounts of Rpbl, Rpb2 and Rpb3 were 
15 separated into the liquid phase, which did not interfere with the successful 
identification of the interacting proteins. Different variations of elution agents 
can be assessed to eliminate or avoid leakage of the immobilized complex 
when separating the first ligand from the immobilized second ligand. 

Another variant is to fractionate further the ligands in the eluate by 
20 chromatography (e.g. liquid chromatography) or other separating techniques. 
However, the fractionation may lead to unnecessary dilution and/or 
distribution of an interacting ligand in several fractions. In this case, all the 
fractions that do not contain the affinity tagged ligand. or contain only 
insignificant amount, can be recombined. 

The elution agent might disrupt partially and nonselectively the binding 
between the affinity tag and the affinity matrix (especially if some affinity tags 
are partially denatured during the preparation of the cellular lysate) and. as a 
result, a certain amount of the immobilized fusion ligand can detach from the 
affinity matrix and co-elute with the interacting ligands. The problem can be 
30 solved in several ways: 

(i) The affinity matrix can be pre-treated under the same conditions as 
the elution conditions in order to wash away any ligand or affinity material. 
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such as IgG. Calmodulin. Amylose etc. that Is not attached strongly enough to 
the solid support (e.g. agarose or sepharose); 

(ii) The highest concentration of the elution agent that does not cause 
separation of the fusion ligand fi-om the affinity matrix can be detennined in a 
pilot experlment(s) using the same affinity tag fused to a standard (control) 
protein. Different fractions can be analyzed by immunodetection (with 
antibody against the affinity tag or against the standard protein) or other 
techniques in order to determine the presence or absence of the affinity tag 
and/or affinity tagged standard ligand. The elution conditions that do not lead 
to separation of the affinity tagged standard ligand from the affinity matrix 
should be used to perform the method of the Invention with the ligand of 
Interest; 

(iii) The presence of an interacting ligand In a fraction that Is eluted 
before the first fraction that contains the fusion ligand is sufficient to prove that 
the interacting ligand is Isolated and/or detected by separating it from the 
immobilized fusion ligand; and 

(iv) If the fusion ligand leaks together with the affinity matrix (e.g. IgG). 
this is a problem with the affinity matrix. 

In one example, the affinity matrix is in the form of beads and can be 
packed into a column so that the removal of the unbound proteins and other 
substances can be achieved by washing the column. The affinity tag binds 
the affinity matrix by predominantly hydrophobic forces and the elution of the 
Interacting ligands is earned out by increasing the ionic strength and/or the 
dielectric constant of the medium. The elution is performed under conditions 
that do not weaken the binding between the affinity tag and the affinity matrix 
or weaken them to a lesser extent than the binding between the ftjslon protein 
and the interacting proteins and/or other substances. Elution by ionic strength 
equivalent to 0.35M KCI for 20 minutes is a good starting point for studying an 
uncharacterized ligand complex. 

The method of the invention is suitable for detecting transient protein- 
protein Interactions, and other transient interactions that involve proteins 
Including but not limited to DNA-protein interactions. RNA-protein interactions' 
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carbohydrate-protein interactions, and iipid-protein interactions. The method 
of the invention is also suitable for detecting transient ligand-ligand 
interactions, where the ligand Includes proteins. DNA. RNA. lipids and/or 
carbohydrates. For example, the method of the invention can be used to 
detect transient DNA-RNA interactions. DNA-DNA interactions. DNA-lipid 
interactions, DNA-carbohydrate interactions. RNA-RNA interactions, RNA- 
lipid interactions, RNA-carbohydrate Interactions, lipid-lipid interactions, llpid- 
carbohydrate interactions and carbohydrate-carbohydrate interactions. 

The invention contemplates the second ligand being a recombinant 
protein that contains at least one affinity tag. This fusion protein may be 
prepared using recombinant DNA methods. Accordingly, a nucleic acid 
molecule encoding the fusion may be incorporated in a known manner into an 
appropriate expression vector which ensures good expression of the fusion 
protein. Possible expression vectors include but are not limited to cosmlds, 
plasmids, or modified viruses (e.g. replication defective retroviruses, 
adenoviruses and adeno-assoclated viruses), so long as the vector is 
compatible with the host cell used. The expression vectors are "suitable for 
transfonnation of a host cell", which means that the expression vectors 
contain a nucleic acid molecule encoding the fusion protein and regulatory 
20 sequences selected on the basis of the host cells to be used for expression, 
which is operatively linked to the nucleic acid molecule. Operatlvely linked is 
intended to mean that the nucleic acid Is linked to regulatory sequences in a 
manner which allows expression of the nucleic acid. 

Suitable regulatory sequences may be derived from a variety of 
sources, including bacterial, fungal, viral, mammalian, or insect genes (For 
example, see the regulatory sequences described in Goeddel. Gene 
Expression Technology: Methods in Enzymology 185. Academic Press. San 
Diego. CA (1990)). Selection of appropriate regulatory sequences is 
dependent on the host cell chosen as discussed below, and may be readily 
accomplished by one of ordinary skill in the art. Examples of such regulatory 
sequences include: a transcriptional promoter and enhancer or RNA 
polymerase binding sequence, a ribosomal binding sequence, including a 
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translation Initiation signal. Additionally, depending on the host cell chosen 
and the vector employed, other sequences, such as an origin of replication, 
additional DNA restriction sites, enhancers, and sequences conferring 
inducibility of transcription may be Incorporated into the expression vector. It 
will also be appreciated that the necessary regulatory sequences may be 
supplied by the native protein of interest. 

The recombinant expression vectors of the invention may also contain 
a selectable marker gene which facilitates the selection of host cells 
transformed or transfected. Examples of selectable marker genes are genes 
encoding a protein such as G418 and hygromycin which confer resistance to 
certain drugs, P-galactosldase. chloramphenicol acetyltransferase, firefly 
luciferase. or an immunoglobulin or portion thereof such as the Fc portion of 
an immunoglobulin preferably IgG. Transcription of the selectable marker 
gene is monitored by changes in the concentration of the selectable marker 
protein such as P-galactosidase. chloramphenicol acetyltransferase, or firefly 
luciferase. If the selectable marker gene encodes a protein conferring 
antibiotic resistance such as neomycin resistance transformant cells can be 
selected with G418. Cells that have incorporated the selectable marker gene 
will survive, while the other cells die. This makes it possible to visualize and 
assay for expression of the recombinant expression vectors. It will be 
appreciated that selectable markers can be introduced on a separate vector 
from the recombinant molecule. 

The recombinant expression vectors encoding the fusion protein 
contain genes which encode at least one affinity tag fused In frame to the 
gene encoding the protein of interest. Typical fusion expression vectors 
include pGEX (Amrad Corp.. Melbourne. Australia). pMal (New England 
Biolabs. Beverly. MA) and pRIT5 (Pharmacia, Piscataway. NJ) which fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, 
respectively, to the recombinant protein. The affinity tag may be fused directly 
or indirectly, such as through a peptide linker, to the recombinant protein. 

Recombinant expression vectors can be introduced into host cells to 
produce a transformed host cell. The temis "transformed with", "transfected 
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with". "transformation" and "transfection" are intended to encompass 
introduction of nucleic acid (e.g. a vector) Into a cell by one of many possible 
techniques known in the art. Prokaryotic cells can be transformed with 
nucleic acid by. for example, electroporation or calcium-chloride mediated 
6 transfomiation. For example, nucleic acid can be introduced into mammalian 
cells via conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation. DEAE-dextran mediated transfection. lipofectin, 
electroporation or microinjection. Suitable methods for transforming and 
transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A 
Laboratory Manual. 2nd Edition, Cold Spring Harbor Laboratory press (1989)). 
and other laboratory textbooks. 

Suitable host cells include a wide variety of eukaryotic host cells and 
prokaryotic cells. For example, the fusion protein may be expressed in yeast 
cells or mammalian cells. Other suitable host cells can be found in Goeddel. 
Gene Expression Technology: Methods in Enzymology 185. Academic Press,' 
San Diego. CA (1991). In addition, the fusion protein may be expressed in 
prokaryotic cells, such as Escherichia coli (Zhang et al., Science 303(5656)- 
371-3 (2004)). 

Yeast and fungi host cells suitable for carrying out the present 
Invention include, but are not limited to Saccharomyces cemvisiae, the genera 
Pichia or Kluyveromyces and various species of the genus Aspergillus. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSed 
(Baldari. et al., Embo J. 6:229-234 (1987)). pMFa (Kurjan and Herskowitz 
Cell 30:933-943 (1982)). pJRY88 (Schultz et al.. Gene 54:113-123 (1987)), 
and PYES2 (Invitrogen Corporation. San Diego. CA). Protocols for the 
transformation of yeast and fungi are well known to those of ordinary skill in 
the art (see Hinnen et al.. Proc. Natl. Acad. Sci. USA 75:1929 (1978); Itoh et 
al.. J. Bacteriology 153:163 (1983). and Cullen et al. (BlolTechnology 5 369 
(1987)). 

Mammalian cells suitable for carrying out the present invention include, 
among others: COS (e.g.. ATCC No. CRL 1650 or 1651). BHK (e g ATCC 
No. CRL 6281). CHO (ATCC No. CCL 61). HeLa (e.g.. ATCC No. CCL 2) 
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293 (ATCC No. 1573) and NS-1 cells. Suitable expression vectors for 
directing expression in mammalian cells generally Include a promoter (e.g.. 
derived from viral material such as polyoma. Adenovirus 2. cytomegalovirus 
and Simian Virus 40), as well as other transcriptional and translational control 
sequences. Examples of mammalian expression vectors include pCDM8 
(Seed. B.. Nature 329:840 (1987)) and p!VlT2PC (Kaufman et al.. ElVlBO J. 
6:187-195 (1987)). 

Given the teachings provided herein, promoters, terminators, and 
methods for introducing expression vectors of an appropriate type into plant, 
avian, and insect cells may also be readily accomplished. For example, within 
one embodiment, the fusion protein can be expressed in plant cells (see 
Sinkar et al.. J. Biosci (Bangalore) 11:47-58 (1987), which reviews the use of 
Agrobacterium rhizogenes vectors; see also Zambryski et al.. Genetic 
Engineering, Principles and Methods. Hollaender and Setlow (eds.), Vol. VI. 
15 pp. 253-278. Plenum Press. New York (1984), which describes the use of 
expression vectors for plant cells, including, among others, PAPS2022, 
PAPS2023, and PAPS2034). 

Insect cells suitable for carrying out the present invention include cells 
and cell lines fi-om Bombyx, Trichoplusia or Spodotera species. Baculovirus 

20 vectors available for expression of proteins in cultured insect cells (SF 9 cells) 
include the pAc series (Smith et al., Mol. Cell Biol. 3:2156-2165 (1983)) and 
the pVL series (Lucklow, V.A., and Summers, M.D., Virology 170:31-39 
(1989)). Some baculovirus-lnsect cell expression systems suitable for 
expression of the fusion protein are described in PCT/US/02442. 

25 Alternatively, the fusion protein may also be expressed in non-human 

transgenic animals such as rats, rabbits, sheep and pigs (Hammer et al. 
Nature 315:680-683 (1985); Palmiter et al. Science 222:809-814 (1983); 
Brinster et al. Proc. Natl. Acad. Sci. USA 82:4438-1442 (1985); Palmiter and 
Brinster Cell 41 :343-345 (1985) and U.S. Patent No. 4.736.866). 

^° embodiment, the fusion protein is expressed 

extrachromosomally and, optionally, the chromosomal copies of the gene (i.e. 
native gene encoding protein of interest) can be silenced. In another example. 
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chromosomal copies of the gene of interest can be replaced by homologous 
recombination. Standard DNA recombinant techniques and yeast 
manipulation techniques are described in Sambrook, Fritsch, Maniatis, 1982 
Molecular Cloning, Cold Spring Harbor Laboratory Press, and Current 
5 Protocols in Molecular Biology, John Wiley and Sons, Inc, New York, 1994. 

Although the construction of affinity tags and expressing fusion proteins 
is not the essence of the present invention, several related issues deserve 
attention. When the organism is diploid, it is best to modify both chromosomal 
copies of the same gene. The affinity tag can be joined at the N-temiinus or 

10 inside the protein or, preferably, at the C-temiinus in order to avoid interfering 
with the expression from the natural promoter. Two different tags can be 
attached to two different subunits of the protein complex. Two different tags 
can be attached to either end of the same protein or to the same end of the 
protein. One tag can be attached to one end of the protein and another 

15 different tag can be Inserted inside the gene. The binding domains of the 
affinity tag can be fused to the protein of Interest directly or a by a linker 
sequence. The fusion protein can contain the whole protein of interest or only 
a part of it. It is best if the affinity tag is fused in such a way that it is 
accessible (i.e. can bind easily to the respective ligand) and does not affect 

20 the expression and folding of the protein and the assembly of a natural protein 
complex. In addition, the affinity tag should not interfere with the protein 
interactions and should be placed outside the interface. Since different 
organisms prefer different codons for the same amino acid, the sequences of 
the affinity tags can be modified to ensure good expression. 

25 A person skilled in the art will appreciate that the recombinant nucleic 

acid expressing the fusion protein contains sequences ensuring its proper 
transcription, processing and translation. Preferably, the fusion protein is 
expressed under physiological conditions enabling its associations with other 
proteins or biomolecules in vivo. 

30 in addition to the entire protein of Interest, it will be appreciated that the 

fusion protein may contain only a portion of the protein of interest. The 
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truncated or fragmented protein of interest may be prepared by expression of 
a truncated gene. 

The scale of the experiment (number of the cells and the amount of 
total protein) depends on several factors: (a) the copy numbers of the ligand 
5 of interest and the interacting ligands.. (b) the binding equilibrium constants of 
particular ligand interactions, (c) the sensitivity of the method that is used to 
identify the interacting ligands. By performing the method of the invention with 
different tagged subunits and with approximately 1011 cells, nearly all 
reported interacting proteins for RNA Polymerase II (approximately 30.000 
copies per cell) were detected by mass spectrometry. The copy number of the 
protein of interest can be detennined by Western blotting and the scale of the 
experiment can be recalculated. 

When the protein of interest is present in low copy number in the cell, it 
can be expressed from a different promoter that generates more copies of the 
protein. Optionally, the promoter can be inducible and if the protein of interest 
is a subunit of a stable protein complex, the other subunits and the fusion 
protein of interest can be expressed from identical inducible promoters in 
order to ensure proper stoichiometries of the stable protein complex. In this 
case, great care should be taken not to over-express the protein of interest to 
20 a level that leads to fonnation of inclusion bodies. 

The fusion protein can originate from one organism and the interacting 
proteins can originate from a different organism (e.g. host organism and 
parasite organism). The affinity tagged protein of interest can be exogenous 
or endogenous. However, the invention gives the most valid results when the 
fusion protein and the interacting proteins are expressed in their endogenous 
organism and their expression is driven from their natural promoters. 

As stated above, the invention Includes usage of affinity tagged 
proteins. It should be noted that the invention not only differs from other 
methods for purification via affinity tag but is even quite opposite to them (i.e. 
30 the greatest care must be taken to avoid the separation of the fusion protein 
from the affinity matrix during the elution). The novelty of the invention is 
apparent by comparison with the other methods for affinity purification as 
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described by Bauer A. and Kuster B. in Eur. J. Biochem. Feb 2003, or. by 
comparison with the other methods for detection and analysis of protein 
interactions as described in a review by Phizicl^y E. and Fields S. in 
Microbiological Reviews, Mar. 1995, Vol. 95, No. 1. 

Moreover, all basic methods for protein purification (I.e. affinity 
chromatography, hydrophobic interactions, ion exchange, reversed phase) 
include separation of the protein that has been bound to the solid phase. The 
invention offers a convenient way for separating substoichiometrically 
associated proteins and/or other biomolecules from the immobilized protein 
and constitutes a novel basic method for purifying proteins and detecting 
protein interactions. 

The invention can also be described as immobilizing selectively a 
complex containing the ligand of interest on a solid phase by binding between 
the ligand of interest and the solid phase and, after removing the unbound 

1 5 substances, separating the transiently associated ligand from the immobilized 
ligand of interest. The ligand of interest likely binds to the solid phase by 
domlnantly one type of force (preferably, hydrophobic force) and elution of the 
transient ligand should be perfomned by weakening another force. As shown 
in the examples of Rpbl fused to a SpA-tag (see Figure 3, 4 and 6), the 

20 ligand of interest (i.e. the fusion protein) binds to the affinity matrix (i.e. IgG 
beads) by hydrophobic force (i.e. Fc reactivity) and the transient ligand (i.e. 
transcription factors) are separated by weakening the electrostatic bonds (i.e. 
increasing the ionic strength of the medium). 

As mentioned above, the method of the invention is not limited to 

25 separating, and optionally analyzing, protein-protein interactions. For 
example, a nucleoprotein complex can be immobilized by binding between its 
nucleic acid component and an affinity matrix coated with complementary 
sequence. The nucleic acid component of the complex can be genetically 
engineered so that it contains a Poly(G) and the complex can be immobilized 

30 on Poly(C) matrix (Sigma). The associated proteins can be eluted with 
increasing the ionic strength of the medium. In this case, the ligand of interest 
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is bound to the affinity matrix by hydrogen bonds and the interacting proteins 
are separated by increasing the ionic strength of the medium. 

In another example, separation of the transient protein interactor (first 
ligand) from the immobilized protein or protein complex (second ligand) can 
5 be achieved by enzymatic treatment (e.g. treatment that removes or adds an 
electrostatic charge as a result of acetylation or deacetylation or 
phosphorylation or dephosphorylation). In this case, essential information can 
be obtained about the nature of a particular protein-protein interaction. 

The method of the invention is especially suitable for detecting proteins 

1 0 that interact with a multicomponent permanent protein complex. It is best if the 

elution is performed under such conditions that all subunits of the permanent 

protein complex remain substantially immobilized because a significant 

leakage into the liquid phase of any subunit. not only the tagged one, can 

suppress the identification and analysis of the substoichiometrically interacting 
1 5 proteins. 

In a further example, the Invention relates to methods of separating a 
first ligand from a second ligand in an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand, wherein the first ligand and 
the second ligand associate with each other by electrostatic forces, and 
20 wherein the second ligand is a recombinant fusion protein containing at least 
two different affinity tags, comprising the steps: 

(a) introducing into a cell or organism a recombinant nucleic acid 
molecule encoding a fusion protein comprising the second ligand 
fused to at least two different affinity tags that can selectively bind 

25 to different affinity matrixes; 

(b) expressing the fusion protein; 

(c) obtaining a sample containing biological complexes that include the 
first ligand and the fusion protein; 

(d) immobilizing the fusion protein on a first affinity matrix via a first 
30 affinity tag; 

(e) removing unbound substances from the first affinity matrix; 

(f) separating the fusion protein from the first affinity matrix; 
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(g) immobilizing the fusion protein on a second affinity matrix via a 
second affinity tag. wliich is different than the first affinity tag; 

(h) removing unbound substances from the second affinity matrix; 

(i) separating the first ligand from the Immobilized fusion protein, which 
remains bound to the second affinity matrix during the separation, 
by decreasing the electrostatic forces between the first ligand and 
the fusion protein; and 

G) optionally, analyzing the separated first ligand. 
In another example, the invention relates to methods of separating a 
first ligand from a second ligand In an affinity matrix system, wherein the first 
ligand associates in vivo with the second ligand. wherein the first ligand and 
the second ligand associate with each other by electrostatic forces, and 
wherein the second ligand is a protein complex containing two or more 
subunits of which are fused to different affinity tags, comprising the steps: 

(a) introducing into a cell or organism recombinant nucleic acids 
molecules encoding fusion proteins comprising the two or more 
subunits of which are fused to different affinity tags that can 
selectively bind to different affinity matrixes; 

(b) expressing the fusion proteins; 

(c) obtaining a sample containing biological complexes that include the 
first ligand and the fusion proteins; 

(d) immobilizing the fusion proteins on a first affinity matrix via a first 
affinity tag; 

(e) removing unbound substances from the first affinity matrix; 
25 (f) separating the fusion proteins from the first affinity matrix; 

(g) immobilizing the fusion proteins on a second affinity matrix via 
second affinity tag. which is different than the first affinity tag; 

(h) removing unbound substances from the second affinity matrix; 

(i) separating the first ligand from the immobilized fusion proteins. 
30 which remain bound to the second affinity matrix during the 

separation, by decreasing the electrostatic forces between the first 
ligand and the fusion proteins; and 



20 



wo 2005/016956 



PCT/CA2004/001448 



-35- 

(j) optionally, analyzing the separated first ilgand. 
This two-step affinity purification using two different affinity tags fused 
to the ligand of interest or fused to two subunits of a ligand complex reduces 
the false positives by reducing the background fi^om the contaminant proteins. 
5 In one example, the dual tags are the TAP (tandem affinity purification) tags 
containing a calmodulin binding domain and IgG binding domain separated by 
a Tev cleavage site. Other dual tags include, but are not limited to. a 
combination of GST and SpA, a combination of GST and SpG, a combination 
of GST and poly-His, a combination of MBP and SpA, a combination of MBP 
10 and SpG, a combination of MBP and poly-His, a combination of SBP and 
SpG, a combination of SBP and SpA, a combination of MBP and HA, and a 
combination of SBP and poly-His. A person skilled in the art will appreciate 
that different tags and different combinations of tags can be used. 

Purification of interacting ligands by fusing two affinity tags to the 
15 protein of interest or to subunits of the protein complex of interest can be 
described as follows: (a) binding the recombinant protein(s) by the first tag to 
the first affinity matrix, (b) washing away the unbound proteins, (c) separating 
the fusion protein(s) from the first affinity matrix, (d) binding the fusion 
protein(s) to the second affinity matrix, (e) washing away the residual 
20 unbound proteins, (f) separating the interacting ligand from the fusion 
protein(s), which remains bound to the affinity matrix. 

An affinity tag that remains bound to the respective affinity matrix at 
ionic strength equivalent to 0.4-0.6M KCI is suitable for carrying out the 
invention when the elution is performed by increasing the ionic strength. 
25 Affinity tags genetically derived from Streptococcal protein G (SpG) or from 
Protein A fl'om Staphylococcus aureus (SpA) bind strongly to IgG-sepharose 
and are suitable affinity tags for carrying out the invention. 

GST:glutathione binding is not affected by ionic strength equivalent to 
0.5 M KCI and thus the GST-tag a good candidate for a single tag for carrying 
30 out the invention or a second tag when a dual affinity tag is used. 

On the other hand, an affinity tag that can be separated from the 
respective ligand under conditions that do not disrupt the transient protein 
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Interactlons can be used as a first tag. For example. GST-tag can be 
separated from glutathione beads by treatment with 5-20 mM reduced 
glutathione. Flag-tag can be separated by adding Flag-tag (DYKDDDDK) and 
can be used as a first tag. 

5 MBP can be separated by treatment with 1 0mM maltose and is suitable 

for a first tag. MBP:maltose binding is not affected by ionic strength that is 
equivalent to 0.7-1 M KCI and is suitable for a second tag or as a single tag. 

Poly-His tag requires stringent conditions for release of the Immobilized 
protein and is not suitable for a first tag. It can be used as a second tag or as 
10 a single tag because the tag remains bound to Ni2+ column at 0.5M KCI. 

The Myc tag and hemagglutinin tag (HA) are suitable for a first tag. The 
HA-tag is more suitable for a second tag or single tag because it contains only 
two electrostatically charged amino acids. 

Other tags that can be used for carrying out the invention are the T7- 
15 Tag sequence (the initial llaa of the T7 gene). VSV-G (15 AA C-terminal 
peptide of VSV-G), b-GAL, Thioredoxin. The list illustrates but does not limit 
the scope of the invention. The review of K. Terpe in Appl. Microbiol. 
Biotechnol. 2003) describes popular affinity tags. 

When the protein of interest is present in the cell in low copy number 
20 and an additional purification step is needed before the separation of the 
interacting ligands from the fusion protein, a combination of two different 
affinity tags can be used. It is best if the second immobilization is performed 
by the stronger tag. i.e. the dissociation constant (Kd) for the binding 
tag:ligand is smaller. In order to avoid leakage of the affinity tagged protein 
25 during the elution. Separation from the first solid support can be performed by 
proteolytic cleavage or by a treatment with a chemical agent. 

The methods of the invention facilitate the identification and analysis of 
the first ligand that is associated with the second ligand in substoichlometric 
amounts. For example, the method of the invention solves the dynamic range 
30 problem of mass spectrometry, because the identification of the low 
abundance proteins (I.e. the first ligand) by mass spectrometry is facilitated if 
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they are analyzed separately from the high abundance ones (I.e. second 
ligand). 

In one example, naturally assembled RNA Polymerase II complexes 
can be immobilized on IgG-beads via a SpA-tagged subunit of the core 
5 complex. Then, the immobilized complexes can be treated with elution buffer 
containing 0.4 M KCI. This method separates the transcription factors into the 
liquid phase but the core complex remains immobilized on the IgG beads. The 
analysis of the transcription factors by mass spectrometry is greatly improved 
by the separation. 

10 The inventors found that increasing the ionic strength does not disrupt 

either the bonds between the second ligand and the matrix (e.g. affinity 
tagged second ligand bound to the affinity matrix, such as a SpA:lg bond) or 
the bonds between the subunits of permanent ligand complexes (e.g. the 
RNA Polymerase II core complex) but leads to rapid and almost complete 

15 dissociation of transient ligand complexes (e.g. transcription factors). Since 
the bonds between the second ligand and the affinity matrix are mainly 
hydrophobic in nature, the same must be the case for binding between the 
subunits of pemianent ligand complexes (i.e. RNA Polymerase II). While, 
transient interactions between the transient ligand complexes are due in large 

20 part to electrostatic interactions, as demonstrated in the examples below. 

Without being limited to theory, increasing the ionic strength 
destabilizes the electrostatic attraction between charged groups (salt bonds) 
because of the Debye-Huckel screening and according to Coulomb's law: F = 
q1q2/Dr2. where: F - strength of the electrostatic force, q - charge of the ion; 

25 D - effective dielectric constant of the media; r - distance between the ions. 
On the other hand. Increasing the salt concentration stabilizes the 
hydrophobic bonds by competing out the water molecules from the non-polar 
patches of the protein surface so that they associate even stronger by 
hydrophobic interaction. This increase In the entropy of water molecules, as 

30 the number of them solvating hydrophobic surfaces decreases, is the main 
driving force for protein precipitation by "salting out" techniques (e.g. 
ammonium sulphate precipitation). 
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In addition, witliout being limited to tlieory. the fact that more than 60 
interacting proteins can be dissociated from the RNA Polymerase II core 
complex with 12 subunits by weakening the electrostatic bonds (i.e. by 
increasing the ionic strength of the medium) allows the formulation of a 
5 general rule: "Permanent protein complexes are held together mainly by 
hydrophobic forces and transient ligand complexes are held together in large 
part by electrostatic attractions (mainly salt bonds between the charged amino 
acids)". The salt bonds between the subunits of permanent ligand complexes 
are not affected by increasing the ionic strength because the ions cannot 

1 0 reach into the hydrophobic interface between the subunits and weaken them. 
The salt bonds that are formed during a transient ligand-llgand interaction are 
not shielded by the hydrophobic interface and they are easily disrupted by 
increasing the ionic strength of the medium. The other electrostatic 
attractions, which are much weaker, i.e. London forces, dipole-dipole. charge- 

15 dipole. are also weakened by increasing the ionic strength. The general rule 
above Is a general rule and allows for some minor exceptions. 

Jones, S and Thornton. J., PNAS 1996 showed by statistical analysis 
that the contact surfaces (interfaces) of transient protein complexes contain 
more hydrophilic residues than the contact surfaces of permanent protein 

20 complexes. The inventors also demonstrated that among all hydrophilic 
residues, the most important for formation of transient protein complexes are 
electrostatically charged ones (e.g. Lysine (positive), Arginine (positive). 
Aspartate (negative), Glutamate (negative) and partially Histidine (50% 
positively charged at pH 6.0)) that can form salt bonds or other electrostatic 

25 bonds. Another group of amino acids that are involved in transient protein- 
protein interactions are the ones that are electrostatically charged as a result 
of post-translational modifications (e.g. phosphorylation, acetylation). 

Archakov et al. proposed, based on a statistical analysis (Archakov, A. 
et al. Proteomics, 2003), that formation of permanent complexes resembles 

30 and even might be a continuation of protein core folding, and henceforth, is 
due to the hydrophobic force. The inventors' results support this point of view 
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and further suggest that stable protein complexes may have a common 
hydrophobic Interior that Is not accessible to the Ions. 

The nature of the transient protein-protein Interactions was unknown 
prior to the Inventors' findings. The Inventors have found that transient proteln- 
5 protein ihteractions are predominantly electrostatic. Moreover, since among 
all electrostatic forces charge-charge Interactions (Coulomb force) are the 
strongest ones, they play a major role in transient protein Interactions. 

Without being limited by theory, three interesting features of the 
Coulomb force explain the inventors* finding that they are responsible for the 
1 0 formation of the majority of the transient protein complexes: 

(I) Coulomb forces are long-range forces and they still play a role at 1 
nanometer distance. All other forces are short-range forces, e.g. London 
forces occur when the molecules are close enough to induce electrostatic 
dipoles in one another. 

15 (II) Coulomb forces are the strongest ones among noncovalent forces 

(up to -80 kcal/mol for salt bonds). The other noncovalent bonds are much 
weaker. Hydrogen bonds are -3 to -6 kcal/mol, Van der Waals bonds are -0.5 
to -1 kcal/mol and hydrophobic bonds are -0.5 to -3 kcal/mol. 

(Ill) Coulomb forces can be either attractive or repulsive, depending on 

20 the nature of the charges involved, I.e. attractive between positive and 
negative charges, and repulsive between two positive charges or between two 
negative charges. 

Without being limited by theory, the Inventors believe that the formation 
and disintegration of transient protein complexes Is due to alternation of 

25 electrostatic attraction and electrostatic repulsion between the electrostatically 
charged amino acids. The switch could reflect the accomplishment of the 
function that requires formation of the transient protein complex and the three 
most plausible causes for the switch are: (a) change of orientation and/or 
position of the Interacting proteins relative to one another, and/or (b) change 

30 of the conformation of one or both of the Interacting proteins and/or (c) adding 
or removing an electrostatic charge as a result of post translational 
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modification. e.g. acetyl group to Lysine or phosphate group to Serine, 
Tyrosine and Threonine. 

Whatever the exact mechanism of transient protein-protein 
Interactions. It seems clear that electrostatic forces are of primary importance. 
5 The invention is designed to detect transient protein-protein Interactions by 
wealcening the electrostatic forces and separating the interacting protein from 
the immobilized ligand of interest. 

The Invention has several significant advantages comparing to other 
methods for protein separation and/or detecting protein interactions. 

10 (1) The method of the invention does not require a purified protein of 

Interest for the preparation of affinity column, 

(II) The immobilized protein or protein complex Is in its natural 
modification state (e.g. phosphorylation state), which Is very often necessary 
for proper formation of protein complexes, 

16 (III) Since many proteins exist and Interact as components of protein 

complexes, but not as single proteins, the invention is suitable for detecting 
transient protein Interactions that involve multicomponent protein complexes. 

(IV) The method of the invention detects proteins that have been 
associated with the fusion protein in vivo. Protein affinity chromatography 

20 according to Formosa T., Methods Enzymol. 1991, and GST pull downs are 
based on in vitro (de novo) association of the proteins with the Immobilized 
protein and that makes possible many nonspecific Interactions. 

(V) The main advantage of the invention over protein purification via 
one or more affinity tags is the separation of the substoichiometrlcally 

25 Interacting proteins from the tagged protein or protein complex and, as a 
result, greatly facilitated analysis and identification. 

Different analytical methods can be used to Identity and analyze the 
proteins that are purified by the method of the invention, e.g. mass 
spectrometry, enzymatic assays, immunodetection. Immunodetection can be 

30 used to analyze the interacting proteins and/or to verify the identity of the 
Interacting proteins that cannot be identified unambiguously by mass 
spectrometry. When the proteins are eluted with ionic strength equivalent to 
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0.3M KCI or 0.4M KCI, they are not denatured and the eluate can be analyzed 
by biochemical assays for the presence of a particular enzymatic activity. 
Optionally, the eluate can be dialyzed against physiological buffer before the 
assay. 

After building a proteomic map around the protein of Interest (i.e. 
identifying unambiguously the interacting proteins), more sensitive methods 
for detecting the interacting protein can be used. For example, antibodies can 
be raised against particular interacting protein and it can be detected by 
immunodetection. However, one of the main utilities of the invention is the 
direct identification (preferably by mass spectrometry) of novel proteins that 
interact with a protein of interest in vivo. 

After separating the interacting proteins in the liquid phase, they can be 
further fractionated by SDS-PAGE (the length of the gel depends on the 
complexity of the eluate) and localized by silver staining or Coomassie 
15 staining. Another variant is to run a native gel and separate different 
permanent complexes from one another. 

The method of the invention is useful for detecting protein interactions 
in organisms where genetic approaches are impossible or very hard (e.g. 
Homo sapiens). The invention can also be used to elucidate the complete set 
20 of interactions that involve pjroteins having associations with human diseases 
and will reveal new targets for therapeutic intervention. Such diseases 
include, but are not limited to, Norrie disease; Alzheimer's disease; 
Parkinson's disease; beta3-adrenergic receptor gene mutation; 
achondroplasia; sickle cell anemia; thrombosis; alpha 1 -antitrypsin deficiency; 
25 and the diseases listed in table 1 . 

Classic features of Norrie disease include specific ocular symptoms 
(pseudotumor of the retina, retinal hyperplasia, hypoplasia and necrosis of the 
inner layer of the retina, cataracts, phthisis buibi). progressive sensorineural 
hearing loss, and mental disturbance, although less than one-half of patients 
30 are hearing impaired or mentally retarded. The Norrie gene encodes for a 
protein of 133 amino acids that has homologies at the C-terminus to a group 
of proteins including mucins. The high proportion of new mutations is an 
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expected finding for an X-Iinked disorder witli greatly reduced male 
reproductive fitness (Berger et al.. 1992). It has been suggested that the 
norrin protein regulates vascularization of the cochlea and retina (Rehm et al. 
2002). 



5 Alzheimer's disease (AD) is characterized neuropathologically by 

progressive deposition of the 40-42 residue amyloid p-Protein (AP) in brain 
parenchyma and cerebral blood vessels. There are two basic types of 
Alzheimer's disease; the more common form, known as late-onset, strikes 
people, who are usually over the age of 65. This form of Alzheimer's is 

10 thought to be due to both genetic and environmental causes. The other form 
called autosomal dominant, early-onset or familial Alzheimer's, strikes people 
who are under the age of 65, and is strictly due to genetic abnormalities. This 
second form accounts for less than 10 percent of all Alzheimer's cases. Yet in 
these cases most of the major break through in discovering the chromosomal 

1 5 mutations that contribute to Alzheimer's disease are being discovered. Known 
mutations are listed in table 2. 

There are similarities between Parkinson's disease and some other 
neurodegenerative disorders, especially Alzheimer's disease. For example, 
the production of Lewy bodies in Parkinson's disease is paralleled by the 
20 similar production of senile plaques in Alzheimer's disease (and to some 
extent in normal aging). The protein that is altered in familial Parkinson's 
disease has similarities to the non-beta amyloid component of Alzheimer's 
disease. Known mutations for Pari<inson's disease include C418R; E46K; 
N279K and R98Q. 

25 The Beta3-adrenergic receptor gene makes a protein in fat cells that is 

involved in determining how much fuel your body burns when you are resting. 
A mutation in this gene slows down how quickly a person bums fat and thus 
increases their tendency to be obese. One specific mutation in this gene, 
called TRP64ARG, is almost four times more common in Pima Indians than in 

30 people of European descent, and is one and a half times more common in 
people of African or Mexican descent. The prevalence of the TRP64ARG 
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gene mutation In these populations probably accounts at least In part for why 
these ethnic groups have a higher rate of Type 2 diabetes. 

Achondroplasia, the nfiost common genetic form of dwarfism, Is an 
autosomal dominant disorder whose underlying mechanism Is a defect In the 
5 maturation of the cartilage growth plate of long bones. Achondroplasia has 
recently been shown to result from a Gly to Arg substitution In the 
transmembrane domain of the fibroblast growth factor receptor 3 (FGFR3). 
although the molecular consequences of this mutation have not been 
investigated. By substituting the transmembrane domain of the Neu receptor 

10 tyrosine kinase with the transmembrane domains of wild-type and mutant 
FGFR3. the Arg380 mutation In FGFR3 Is shown to activate both the kinase 
and transforming activities of this chimeric receptor. Residues with side chains 
capable of participating in hydrogen bond fomiation. including Glu, Asp, and 
to a lesser extent. Gin. His and Lys, were able to substitute for the activating 

15 Arg380 mutation. The Arg380 point mutation also causes ligand-independent 
stimulation of the tyrosine kinase activity of FGFR3 Itself, and greatly 
Increased constitutive levels of phosphotyrosine on the receptor. These 
results suggest that the molecular basis of achondroplasia is unregulated 
signal transduction through FGFR3. which may result In inappropriate 

20 cartilage growth plate differentiation and thus abnormal long bone 
development. Achondroplasia may be one of the number of cogenltal 
disorders where constitutive activation of a member of the FGFR family leads 
to development abnomnalities. 

Sickle cell anemia is one of the first illness to be understood at the 
molecular level. In this genetic disease, Glu is replaced by a Val in the beta 
giobin chain. A single base substitution brings about this change. This is a 
nasty substitution because it replaces an acidic amino acid with a hydrophobic 
sidechain. The Val fits nicely into a hydrophobic pocket In another beta giobin 
subunit causing a polymerization between beta chains. This creates a kind of 
30 domino effect leading to the formation of long fibers made of many millions of 
damaged hemoglobulln molecules. The Hb polymers cause the cells to distort 
into a sickle shape. The mutation is lethal for homozygotes but the mutation 
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persists in populations with a high incidence of malaria. This Is generally 
thought to be because heterozygotes have a higher resistance to malaria. The 
malaria parasite spends some of its life in the red blood cell and these sickle 
cells are not a good host. 

5 Thrombosis is a serious condition. I<illing hundreds of thousands and 

debilitating millions each year by myocardial infraction, pulmonary embolism, 
or stroke. Inherited thrombophilia is a genetically determined tendency to 
venous thromboembolism. The temi reflects the presence of an inherited 
factor that, per se. predisposes towards thrombosis, but which, because of the 
10 episodic nature of thrombosis, requires interaction with other factors (inherited 
or acquired) before onset of the clinical disorder. Mutations in genes encoding 
proteins that activate coagulation pathways or inactivate anticoagulation 
mechanisms play an important role in the predisposition to venous 
thrombosis. Blood coagulation is downregulated by the protein C 
15 anticoagulant system. Protein C is activated on endothelial cells by thrombin 
bound to thrombomodulin. Together with its cofactor, protein S. activated 
protein C degrades the activated forms of factor V and VIII. Resistance to 
activated protein C is considered the most prevalent inherited cause of 
venous thrombosis. The most common cause of this resistance is a point 
20 mutation in the factor V gene (Factor V Leiden) that replaces an arginine 
residue at position 506 with a glutamine. R506Q (G to A transition at position 
1691). This change causes the mutant factor Va to resist proteolysis by 
activated protein C. The frequency of the Leiden genetic variation is relatively 
high in Caucasian populations (up to 6%) but it is much lower in African and 
Asian populations (down to 0%). Prothrombin (factor II) is the precursor of 
thrombin, the final effector of the clotting cascade that leads to the fomiation 
of fibrin. Prothrombin potentiates coagulation by positive feedback loops and 
also promotes anticoagulation by the protein C pathway. A G to A mutation at 
nucleotide 20210 which is the last residue at the 3'-untranslated region of the 
30 prothrombin gene has been found to be associated with higher plasma 
concentrations of prothrombin and a 2.8-fold increased risk of venous 
thrombosis. The prothrombin 20210 G to A mutation carrier frequency is 
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hlghest among whites of Southern European origin (3%), slightly lower among 
whites of Northern European origin (1.7%), uncommon among persons of 
Africa (0.7%) and very uncommon among persons of Asian origin (0%). 

Hyperhomocystlnaemia has also been shown to be a risk factor for 
5 venous thrombosis among other conditions. The most common genetic cause 
of hyperhomocystlnaemia is an amino acid substitution in 
methylenetetrahydrofolate reductase (MTHFR). the enzyme that converts 
5,10-methylenetetrahydrofolate to 5-methyitetrahydrofolate. the methyl donor 
in the conversion of homocysteine to methionine. A point mutation which 

10 changes a C to T at nucleotide 677 results in the substitution of valine to 
alanine at position 222 (C677T). The mutant enzyme exhibits reduced activity. 
Homozygous mutation in MTHFR is associated with a 3-fold increase in risk 
for premature cardiovascular disease. The frequency of this genetic variant is 
relatively high in French Canadian. European, Middle Eastem and Japanese 

15 populations (12-15%), lower in Finnish (5.4%) and Dutch (5.2%) populations, 
and very low In African-American populations (1 .4%). 

By far the most prevalent type of clinically important alpha 1 -antitrypsin 
(AAT) deficiency is classified as phenotype PI Z. AAT deficiency is the most 
prevalent potentially lethal hereditary disease of Caucasians and was 

20 discovered by Laurell and Eriksson in 1963. One of the amino acid 
substitutions (GIU342 ^ Lys342) may result in misfolding of the AAT, leading 
to intracellular accumulation and intracellular degradation of the abnormal 
protein (SIfers.R.N., Flnegold,M.J., and Woo,S.L.C. 1989. Alpha-1 -antitrypsin 
deficiency: Accumulation or degradation of mutant variants within the hepatic 

25 endoplasmic reticulum. Am.J.Respir.Cell Mol.Biol. 1:341-345). The structural 
alteration in the Z variant appears to allow "loop-sheet polymerization" of the 
molecule, during which the reactive center loops of one molecule become 
inserted into an opening in the A sheet of another molecule (Lomas,D.A., 
Evans.D.L, Finch,J.T.. and Carrell.R.W. 1992. The mechanism of Z alpha- 

30 antitrypsin accumulation in the liver. Nature 357:605-607, Lomas.D.A., 
Evans.D.L, Stone,S.R., Chang, W.-S.W., and Carrell.R.W. 1993. Effect of the 
Z mutation on the physical and inhibitory properties of a1 -antitrypsin. 
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Biochemistry 32:500-508). The S variant: The S variant has a single amino 
acid substitution (Glu264 ^ Val264) when compared with the most prevalent 
normal type of AAT. The S mutation is not associated with Intracellular 
accumulation of the protein, and the S protein inhibits elastase normally. The 
5 amounts of the S protein that reach the circulation are slightly lower than 
nomial, because of intracellular degradation of the AAT before it is secreted 
(Curiel.D.T., Chytil,A.. Courtney.M.. and Crystal.R.G. 1989. Serum a1- 
antitrypsin deficiency associated with the common S-type (Glu-Val) mutation 
results from intracellular degradation of a1-antitrypsin prior to secretion. 

10 J.BioLChem. 264:10477-10486). The S allele is slightly more prevalent than 
the Z allele among U.S. Caucasians, and it is much more prevalent in the 
Iberian peninsula and neighboring countries. Individuals with the Pi S 
phenotype do not appear to be at increased risk for lung or liver 

Overo lethal white syndrome (OLWS) is an inherited syndrome of foals 

15 born to American Paint Horse parents of the overo coat-pattern lineage. 
Affected foals are totally or almost totally white and die within days from 
complications due to intestinal aganglionosis. Related conditions occur in 
humans and rodents in which mutations in the endothelin receptor B (EDNRB) 
gene are responsible. EDNRB is known to be involved in the developmental 

20 regulation of neural crest cells that become enteric ganglia and melanocytes. 
There is a polymorphism in the equine EDNRB gene closely associated with 
OLWS, This lie to Lys substitution at codon 1 1 8 is located within the first 
transmembrane domain of this seven-transmembrane domain G-protein- 
coupled receptor protein. 

25 The method of separating and optionally analyzing ligands that form 

transient complexes with other ligands can be used in drug or pre-drug 
screening assays. For example, the eluate obtained from the method of the 
invention described above, can be analyzed for the presence of substances 
that can be drugs or pre-drugs. In addition, a drug library can be applied to the 

30 living cells and/or the cellular lysate, and then the eluate can be analyzed for 
the presence of one or more members of the library. Also, after the elution of 
the interacting proteins, the immobilized protein or protein complex can be 
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used for screening libraries of chemicals for a particular chemical that binds to 
it. Methods for drug discovery are described by Rossi D. and Sinz M., Marcel 
Dekker 2001 "Mass Spectrometry in Drug Discovery". 

Moreover, the invention can be used as a novel method for drug design 
5 and drug discovery. A chemical can be identified as a drug or a pre-drug by its 
capability to affect the formation and/or disintegration of a particular protein 
complex that is associated with a disease. Such a chemical should have two 
characteristic features: (a) it should affect selectively a particular protein 
interaction (or group of interactions involving a protein that is associated with 

10 a disease), unlike increasing the ionic strength which affects transient protein 
Interactions nonselectively. and (b) it should affect the particular protein 
interaction at a concentration below 10 mM, i.e. a concentration that does not 
change significantly the ionic strength. After the chemical or a biomolecule Is 
identified as a pre-drug. it can be further modified and the experiment can be 

15 repeated with lower concentrations of the said chemical or biomolecule. This 
can be used to obtain such a chemical that affects selectively the formation 
and/or disintegration of the protein complex at micromolar concentration. I.e. 
concentration that Is appropriate for treatment of human subjects. 

The effect of the drug or pre-drug can be determined by monitoring the 

20 presence and/or the concentration of the interacting proteins in the eluate. 
Preferably, after building a proteomic map around the protein of interest (i.e. 
identifying unambiguously the interacting proteins), other methods, that are 
more sensitive than mass spectrometry, can be used. For example, 
antibodies can be raised against a particular protein, and then it can be 

25 detected by immunoassays. Also, the interacting proteins can be tagged and 
detected by antibodies against the tag. 

The effect of the pre-drug on a particular protein interaction can be 
tested in several ways: 

(i) The pre-drug can be added to the growing cells, the lysis buffer and 
30 the washing buffer. In this case, the elution is performed with washing 

buffer without a pre-drug. Altematively. the elution is performed with 
washing buffer with increased ionic strength but without a pre-drug. 
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(ii) When the chemical is applied to the immobilized protein complex 
after the removal of unbound substances, it can be used as a single 
elution agent or together with one or more nonspecific elution agents, 
i.e. the elution can be done with the chemical in the washing buffer or 
5 with the chemical In the elution buffer. 

(ill) The high salt eluate, optionally dialyzed against washing buffer or 
other physiological buffer, can be recombined with the immobilized 
fusion protein and the pre-drug and incubated for 10-60 minutes or 
more, depending on the concentration of the proteins and the 

10 equilibrium binding constant of the protein interaction. The restoration 

of the protein complex can be detemiined by monitoring the amount of 
one or more interacting proteins in the eluate and/or on the affinity 
matrix (i.e. after the incubation and the removal of unbound proteins, 
the elution is performed with high salt buffer). 

15 A method for drug design and drug discovery that is based on the 

finding that the nature of transient protein-protein interactions is dominantly 
electrostatic contains the following steps: 

(i) Selecting a protein or a protein complex associated with a disease 
and obtaining a cell line or organism with an affinity tagged gene(s). 

20 According to our finding, a disease that is caused by a mutation that 

changes the electrostatic properties of a protein is due to an aben-ation 
In a transient protein interaction. The mutation can change the 
electrostatic properties of the protein by replacing Lysine and/or 
Arginine and/or Aspartate and/or Glutamate and/or Histidine with an 

25 uncharged or oppositely charged amino acid. In addition, a mutation 

can change the electrostatic properties of the protein by replacing other 
amino acids with Lysine and/or Arginine and/or Aspartate and/or 
Glutamate and/or Histidine. See Table 1 and Appendix 1 , and note how 
many severe diseases are caused by such mutations. 

30 (ii) Performing the method of the invention and/or other affinity 

purifications in order to sort out which group of proteins are associated 
in stable protein complexes (i.e. associated mainly by hydrophobic 
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15 



20 



30 



forces) and which ones are associated in transient complexes (I.e. 
associated mainly by electrostatic forces). Building a proteomic map 
(i.e. map of protein-protein Interactions) around the protein that is 
associated with the 



5 (ill) Performing the method of the invention under different conditions 

(functional proteomics) and/or with cell lines originating from 
organism(s) with a disorder. Detennining which protein Interaction is 
associated with the specific disease and determining the structure and 
the composition of the interface (the part of a protein surface that is 
10 interacting with the other protein). 

(iv) Designing and/or synthesizing a chemical or a group of chemicals 
capable of binding to the mutant protein and restoring, at least partially, 
Its electrostatic properties. Alternatively, the drug may be designed to 
bind to the native protein interactor and modify its electrostatic 
properties in such a way that it can interact with the mutant protein. An 
already existing drug or chemical can be used in the beginning and 
later can be modified. The drug should consist of two parts - (I) Part 
that binds selectively to the protein and ensures the specificity of the 
drug. Methods for designing and/or synthesizing chemicals that bind 
selectively to particular protein structure (most often by shape 
complementarity) are described in the literature and known to persons 
skilled in the art - Pong S., Blopharmaceutical Drug Design and 
Development, Blackwell Publishing 1999, (II) Part that is 
electrostatically charged and restores, at least partially, the 
25 electrostatic properties of the protein. For example, if a mutation is 

replacing a positively charged amino acid (Lysine or Arglnine) with 
uncharged one, the drug should contain a positively charged group and 
after the binding of the drug to the protein, such group should be 
located at the same place as the mutated amino acid or close to it. 
Designing a drug in such a way that it binds selectively to a mutant 
protein and brings in an electrostatic charge that modifies the 
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electrostatic properties of the mutant protein is a novel approach for 
drug design. 

(v) Testing the capability of the chemical (pre-dnjg) to affect the proper 
formation and/or disintegration of the protein complex that is 
5 associated with a disease. 



The following non-limiting examples are illustrative of the present 
invention: 



10 EXAMPLES 

Example 1: Robl-SpA-taq fusion 

Heterologous DNA encoding a selectable marker and a SpA-tag fused 
in-frame to the carboxy-terminus of Rpb1 was constructed and administered 
into Saccharomyces cerevisiae cells. The resulting strain expresses a fusion 

15 protein containing Rpbl and a SpA-tag containing four IgG-binding domains. 
After selecting the transformants, the proper expression of the fusion protein 
was checked by Western blotting. Cell cultures were grown in YPD medium at 
SO^C and constant vigorous shaking and cells were collected at OD600 = 1 .2. 
The following protocol was used: 

20 1. Cells were collected by centrifugation at 4,000 rpm for 4 minutes 

and the medium was decanted. All subsequent manipulations were 
perfomned on ice or at 0°C to 5*'C. 

2. Cells were suspended in 200 ml water and distributed into 50ml 
plastic tubes; centrifiiged at 4,000 rpm for 4 minutes and the water 

25 was decanted. 

3. Frozen in liquid nitrogen for 1 minute. 

4. The solid pellet was broken, and protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice were added; ground with a coffee grinder 
for 1 minute; the powder was transferred to a beaker and, after 

30 allowing it to melt, the same volume of 2x Lysis buffer was added. 

and the lysate was stirred with a plastic rod for 2 minutes. 2x Lysis 
Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% glycerol, 2% 
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DMSO, 0.2% TritonXlOO, 2mM PMSF, 2mM benzamidine, 2x 
phosphatase inhibitors. 

6. Centrifuged at 16,000 rpm for 30 minutes and the clarified lysate 
was transferred to a clean tube. 

5 6. 800 microliters of IgG beads was added and the tube was rotated 

for 2 hours. 

7. Centrifuged at 3,000 rpm for 2 minutes and the supernatant was 
decanted. 

8. The IgG beads were transfered and distributed to four parallel 1 0ml 
10 chromatography columns. Each column was washed with 10ml 

washing buffer (WB) by applying 200 microliter aliquots. WB: 
lOOmM KCI, lOmM HEPES pH 7.4. 5% glycerol, 1% DMSO, 0.1% 
TritonXlOO. 

9. The beads were transferred to a 5ml tube and 3ml of WB containing 
15 0.4M KCI (WB (0.4M)) was added. 

10. The tube was rotated for 10 minutes. 

11. The beads and the buffer were transferred to a 10ml 
chromatography column and the eluate was collected by gravity 
flow. The beads were washed with two column volumes of WB 

20 (0.4M) and the eluates were combined. 

12. The eluate was distributed to eppendorf tubes and 1/5 volume of 
100% TCA was added. 

13. Incubated on ice for 1 hour; centrifuged for 1 hour at 14,000 rpm at 
O^C to 5°C. 

25 14. The supernatant was decanted and 1 ml 90% acetone was added. 

Vortexed for 30 seconds and centrifuged for 10 minutes at 14,000 
rpm. The supernatant was decanted. 

15. Dried with a SpeedVac for 15 seconds. 

16. Protein gel electrophoresis (SDS-PAGE - 10% gel, 20 cm long) and 
30 silver staining were performed. 

A gel image of high salt eluate from immobilized RNA Polymerase II 
from Saccharomyces cerevisiae is shown in Figure 3. The digits and arrows at 
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the right side of the gel indicate the identified proteins. Legend: 31 - Spt6. 32 
- Rgri, 33 - Spt6 + Spt5. 34- Tfgl + Spt5, 35- Set3, 36 - Cetl + Medl, 37 - 
Pob3, 38 - Npl3, 39 - Tfg2 + iwsl , 40 - Cdc73, 41 - TFIIS. 42 - Essi 

Protein identifications by mass fingerprinting were made essentially as 
5 described in Shevchenko A et.al. Anal. Chem.. 1996. and Van Montfort, B.A. 
et.al.. J. Mass Spectrom., 2002. After the SDS-PAGE and silver staining the 
gel lanes were cut across the whole length (including the bands stained most 
weakly with silver and "blank" regions) and the proteins in the gel pieces were 
reduced by DTT treatment and alkylated by iodoacetamlde treatment. After 

10 trypsin digestion, the resulting peptides were purified and concentrated by 
reverse phase chromatography on CIS-matrix. The peptides were co- 
crystallized with CHCA (a-cyano-4-hydroxycinnamic acid) ionization matrix 
and subjected to MALDI-TOF MS. After obtaining the MALDI-TOF MS 
spectra, the proteins were identified by matching the list of experimentally 

1 5 obtained peptide masses with computer-generated lists of peptide masses for 
every predicted protein in a Saccharomyces cemvisiae databanks. When 
contaminant proteins were identified in the sample, the corresponding peptide 
masses were subtracted and the search was performed again with the 
remaining masses. When a band is not indicated by an arrow, it means that it 

20 contains alternative translation products of proteins that are indicated 
elsewhere or their degradation products or contaminant proteins. The proteins 
identified with high confidence are listed in the text (the legends) and 
indicated in the Figures. 

A gel image of high salt eluate from immobilized RNA Polymerase II 
25 from Saccharomyces cerevisiae is shown in Figure 4. The fusion protein 
contains Rpbl and a SpA tag containing two IgG binding domains. 18 liters 
culture was grown to OD600 = 0.7 and the separation of the interacting 
proteins was perfonned with washing buffer containing 0.4 M KCI (WB(0.4M)). 
The following protocol was used: 

30 1. The cells were collected by centrifugatlon at 4.000 rpm for 4 minutes 

and the medium was decanted. All subsequent manipulations were 
perfonned on ice or at 0**C to 5°C. 
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2. The cells were suspended in 200 ml water and distributed into 50ml 
plastic tubes; centrifuged at 4,000 rpm for 4 minutes and the water was 
decanted. 

3. Frozen in liquid nitrogen for 1 minute. 

5 4. The solid pellet was brol<en, and protease inhibitors and 2-3 pieces 

(around 8 cm^) of dry ice were added; ground with a coffee grinder for 
1 minute; the powder was transferred to a beaker and, allowed to melt, 
the same volume 2x Lysis buffer was added. Stirred with a plastic rod 
for 2 minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% 
10 glycerol, 2% DMSO, 0.2% TritonXlOO, 2mM PMSF, 2mM benzamidine, 

2x phosphatase inhibitors. 

5. Centrifuged at 16,000 rpm for 30 minutes and transferring the clarified 
lysate to a clean tube. 

6. 800 microliters of IgG beads were added and the tube was rotated for 2 
15 hours. 

7. Centrifuged at 3,000 rpm for 2 minutes and the supernatant was 
decanted. 

8. The IgG beads were transferred and distributed to four parallel 10ml 
chromatography columns. Each column was washed with 10ml 

20 washing buffer (WB) by applying 200 microliter aliquots. WB: lOOmM 

KCI, 10mM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXlOO. 

9. The beads were transferred to a 5ml tube and 3ml WB was added 
containing 0.4M KCI (WB (0.4M)). 

10. The tube was rotated for 10 minutes. 

25 11. The beads and the buffer were transferred to a 1 0ml chromatography 

column and the eluate was collected by gravity flow. The beads were 
washed with two column volumes of WB (0.4M) and the eluates were 
combined. 

12. The eluate was distributed to eppendorf tubes and 1/5 volume of 100% 
30 TCA was added. 

13. Incubated on ice for 1 hour; centrifugation for 1 hour - 14,000 rpm at 
0°C to 5*'C. 
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14. The supernatant was decanted and 1 ml 90% acetone was added. 
Vortexed for 30 seconds and centrifuged for 10 minutes at 14,000 rpm. 
The supernatant was decanted. 

15. Dried with a SpeedVacfor 15 seconds. 

5 16. Protein gel electrophoresis (SDS-PAGE - 10% gel, 20 cm long) and 

silver staining were performed. 
Legend: 51 - Spt5 + Spt6, 52 - Spt5 + Spt6. 53 - Spt5 + Spt6. 54 - Ctr9 + 
Spt5 + Spte, 55 - Spt5 + Ydl145 (Copi) + Spt6, 56 - Tfg1 + Spt5 + Spt6 + 
Spt16. 57 - Spt5 + Tfgl + Sec21, 58 - Spt5 + Tfgl, 59 - Set2 + Spt5 + Rtf1 + 

10 Ydl145 (Cop1), 60 - Cetl. 61 - Tfg2 + Iwsl. 62 - Ceg1. 63 - Cdc73. 64 - 
TFIIS. 65 - Rtt103, 66 - Tfg3, 67 - Yhr121w, 68 - Essi . 
Example 2: Nons oeciffic binding to laG 

Besides the separation of the interacting proteins and other 
biomolecules from the immobilized fusion protein, increasing the ionic 

15 strength causes separation of residual contaminant proteins from the affinity 
matrix. One way to avoid false signals (i.e. eliminate the background from the 
contaminants) is to perfomi the method of the Invention with cellular lysate 
from an organism devoid of affinity tagged protein and identify the proteins in 
the eluate as contaminants. Alternatively, the proteins that are isolated by 

20 performing the method of the invention with several biologically unrelated 
proteins can be compared and the common ones can be identified as 
contaminants. The contaminants are not indicated In the Examples. 

An example of identifying proteins that bind nonspecifically to IgG 
beads and/or to other proteins (contaminants) is shown in Figure 5. Protein 

25 extract firom Saccharomyces cerevisiae lacking an affinity tagged protein was 
incubated with IgG beads, the unbound proteins were removed and proteins 
that bind to IgG-sepharose and/or to other proteins nonspecifically were 
eluted with high salt in the washing buffer. The same protocol as in Figure 4 
was used. Legend: 70 - Acc1+ Fasi + Fas2, 71 - Lys2 + Ydr098, 72 - Mdn1 

30 + Tra1 , 73 - Sec27 + Ade3, 74 - Prti + Lys2. 75 - Ssel + Lsg1 , 76 - Ssbl + 
Ssa2 + Pabi + Dedl, 77 - Hsp60 + Dhh1. 78 - Gcdil + Ded1 + Nop12 + 
Nog2. 79 - Tef2 + Phrl, 80 - Tef2 + L3, 81 - Nop3 + Tef2, 82 - Yor060 + 
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Ydjl . 83 - Stml + Rpl4. 84 - Tif34, 85 - RppO + Rpl5, 86 - Tef2 + Nop1 , 87 - 
Rps1 + Rpl2, 88 - Rps3 + Rpl8. 89 - Rps2, 90 - Rps5. 91 - Rps8 + Rpl9, 92 

- RpMO. 93 - Rps20. 94 - Rpl20, 95 - Rps14. 96 - Rps17. 97 - Rpl27 + 
Rps24. The following contaminants are not indicated in the other figures: 

5 Phrl . Tef2, Nop1 . Acc1 . Nop3. Fks1 , YorOSO. Fas2. Ydj1 , Fun1 2. Stm1 , Aro1 , 
Rpl4. Lys2. Tif34. Ydr098, RppO, Rpg1. Rpl5. Rps3, Ade3, Rps1b, Sec27, 
Rpl2. Prt1. Rpl8, Tef2, Rps2, Lsg1, Rps5, Dhh1. RpMO, Ded1, Rpl9. Sszl, 
Hsp60. Rps8, Gcd11. Rps20, Nop12, Rpl20, Nog2, Pab1. Rps17, Fas1, Ssbl, 
Ssa2, Sse1. Rpl27. Rps14, Rps24, Hfa1, Zuo1. Asc1, Mdn1, Tra1, Dynl. 
10 Myo2, Ira2. 

Example 3; Higher salt eluates 

After Identifying the contaminants for a particular organism, the 
possibility to assign a contaminant protein as a true interactor, i.e. to produce 
a false positive interactor, are minimal and the experiments can be performed 

15 under less stringent conditions. For example, after immobilizing the 
complexes on the affinity matrix, the removal of the unbound proteins can be 
very brief (i.e. washing 200mcl IgG beads with 6 ml washing buffer) and the 
elution of the Interacting proteins can be performed by a higher salt 
concentration (i.e. 0.7 M potassium acetate). High salt eluates from 

20 immobilized RNA Polymerase II complexes from Sacchammyces cemvisiae 
obtained in three different experiments from three different strains each 
expressing different SpA-tagged subunit are shown in the following Figures. 
SpA-tags contain two IgG binding domains. The fusion protein in Figure 6 
contains Rpbl, the fusion protein in Figure 7 contains Rpb3, the fusion protein 

25 in Figure 8 contains Rpb9. The eluates were resolved by SDS-PAGE and the 
proteins were identified by MALDI-TOF MS 

Legend for Figure 6: 120 - Spt6, 121 - Spt6, 122 - Spt6 + Cyr1 + Clu1, 
123 - Tfgl + Nam7 + Spt16, 124 - Fcpl + Cdc48 + Sec21, 125 - Taf5 + 
Top2, 126 - Cetl + Pcfl + Prp6, 127 - Pob3, 128 - Rtt103 + Ptc3 + Has1, 

30 129-Tfg2 + lws1 +Pap1, 1 30 - Tfg2 + Iwrl ; 131-Rail, 132-Sua7, 133- 
TFIIS, 134 - Ypl253, 135 - Ssn3 + Mrt4, 136 - Tfg3 + Rcl, 137 - Med7, 138 

- Yhr121 w, 139 - Mbfl + Taf9 + Yor379c, 140 - Ybr262c + Ybr174c. 
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Legend for Figure 7: 150 - Spt6. 151 - Spt5 + Ctr9 + New1, 152 - 
Spt16 + Tor2, 153 - Tfgl + Hpr5 + Nam7, 154 - Fcpl + Cdc48 + Nam?, 155 
- Fcpl + Rad3, 156 - Fcpl + Taf5 + Pbpl. 157 - Cet1 + Cyrl. 158 - PobS. 
159 - RttlOS + PtcS. 160 - TFIIH + Rad23 + Drsi, 161 - Tfg2 + Iwsl. 162 - 
6 Rail + Pob3 + Rgri + Ygr086c. 163 - Funll + Subl, 164 - TFIIB, 165 - 
TFIIS + YOI045, 166 - Tfg3. 167 - Din7. 168 - Yhr121w. 169 - Nhp2, 170 - 
Mbfl . 

Legend for Figure 8: 180 - Spt6 + Stt4 + Ydl145 (Cop1). 181 - Spt5 + 
Newl +Tof1. 182-Spt5 + Rad3. 183 - Tfgl + Cdc48 + Rgri + Nam7. 184- 

10 Fcpl + Cdc48 + Rad1 + Sec21. 185 - Fcpl + Sup35 + Rap1, 186 - Cetl + 
Rapl + Cdc6, 187 - Tfg2 + Iws1. 188 - Cegl + Kini + Prp8, 189 - Cegl + 
Rgml, 190 - Ygr086c + Ydr065w. 191 - TFIIB, 192 - TFIIS + Thol + BIml, 
193 - Tfg3 + Tfa2 (TFIIE). 194 -Set3 + Tfb2 (TFIIH) + Yol022c. Cdc6, Prp2.' 
Prp16, Rifl and Cfti were also detected. 

15 Example 4: Additional purification step to reduce false positives 

Yet another way to reduce the possibility of producing false positives is 
to reduce the background from the contaminant proteins by performing an 
additional purification step before the separation of the interacting proteins 
from the fusion protein. Preferably, the additional step is an affinity purification 

20 step and in this case, two different affinity tags can be fused to the protein of 
interest (i.e. protein is fused to a dual tag) or to two subunits of a protein 
complex, and, after the second immobilization, the interacting proteins are 
separated from the immobilized fusion protein. The tag used in the following 
experiments is a modified version of the TAP-tag created by Ghaemmaghami, 

25 S., et.al. Nature, 2003. 

A gel image of high salt eluate from immobilized TFIIF from 
Saccharomyces cerevisiae is shown in Figure 9. The carboxy-termlnus of the 
chromosomal copy of the gene encoding Tfg1 is fused in-frame with a TAP- 
tag. 18 liters Saccharomyces cerevisiae culture was grown In YPD medium to 
30 OD600 = 0.7. The complex was purified on IgG beads and further Immobilized 
on calmodulin beads. The elutlon was performed with 0.3 M KCI in the buffer. 
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Legend: 201 - Rpb1. 202 - Rpb1+Rpb2, 203 - Fcp1 + Rpb2 + Rpb1. 
204 - Rpb1 + Rpb2 + Fcpl , 205 - Glo3, 206 - Rpb3. 207 - TFIIS, 208 - Mpd2, 
209 - Rpb4, 210 - Yhr121w + Rpb5, 211 - Rpb6. 212 - Rpb7, 213 - Mbfl + 
Rpb8, 214 - Rpb9. 
5 The following protocol was used: 

1 . The cells were collected by centrlfugation at 4.000 rpm for 4 minutes 
and the medium was decanted. All subsequent manipulations were 
perfomied on ice or at O^C to S^C. 

2. The cells were suspended in 200 ml water and distributed into 50ml 
10 plastic tubes; centrlfuged at 4.000 rpm for 4 minutes and the water was 

decanted. 

3. Frozen in liquid nitrogen for 1 minute. 

4. The solid pellet was broken and protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice were added; ground with a coffee grinder for 

16 1 minute; the powder was transferred to a beaker and, after allowing it 

to melt, the same volume of 2x Lysis buffer was added. Stirred with a 
plastic rod for 2 minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES 
pH 7.4. 20% glycerol. 2% DMSO. 0.2% TritonXI 00, 2mM PMSF, 2mM 
benzamidine, 2x phosphatase inhibitors. 

20 5. Centrlfuged at 16.000 rpm for 30 minutes and the supernatant was 

transferred to a clean tube. 

6. 800 microliters of IgG beads was added and the tube was rotated for 2 
hours. 

7. Centrlfuged at 3,000 rpm for 2 minutes and the supernatant was 
25 decanted. 

8. The IgG beads were transferred and distributed to three parallel 10ml 
chromatography columns. Each column was washed with 10ml 
washing buffer (WB) by applying 200 microliter aliquots. WB: lOOmM 
KCI, 10mM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXI 00. 

30 9. Washed with 400 microliters WB containing 2mM DTT. 
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10. The beads were transferred with 2ml WB containing 2mM DTT to two 
eppendorf tubes and 500 units Tev protease was added. Rotated at 
0°C to 6°C for 2 hours. 

, 11. The beads and the buffer were transferred back to 10ml 
. 5 chromatography column and the eluate was collected. The beads were 

washed with two more column volumes of WB containing 2mM DTT 
and the eluates were combined in a 5 ml tube. 
12.500 microliters calmodulin beads and 5mM CaCb and 3 ml WBC 
(washing buffer for calmodulin beads were added. WBC - 100 mM 
10 KCI, 10 mM HEPES pH 7.4, 5 mM DTT, 6 mM CaCb. 1 mM Immidazol, 

5% glycerol, 0.1% TritonXIOO. Rotated for 1 hour at 0°C to 5°C. 

13. The beads were transferred to 10ml chromatography column and 
washed with 5ml WBC. 

14. The beads were transferred to a 5ml tube and 3ml WBC containing 
15 0.3M KCI (WBC(0.3M)) was added. The tube was rotated for 10 

minutes. 

15. The beads and the buffer were transferred to a 10ml chromatography 
column and the eluate was collected by gravity flow. The beads were 
washed with 2 column volumes of WBC (0.3M) and the eluates were 

20 combined. 

16. The eluate was distributed to eppendorf tubes and 1/5 volume of 100% 
TCA was added. 

17. Incubated on ice for 1 hour; centrifuged for 1 hour -- 14,000 rpm at 0°C. 

18. The supernatant was decanted and 1 ml 90% acetone was added. 
25 Vortexed for 30 seconds and centrifuged for 10 minutes at 14,000 rpm. 

The supernatant was decanted. 

19. Dried with a SpeedVac for 15 seconds. 

20. Protein gel electrophoresis (SDS-PAGE - 10% gel, 20 cm long) and 
silver staining were performed. 

30 A gel image of high salt eluate from immobilized RNA Polymerase II 

from Saccharomyces cerevisiae is shown in Figure 10. The carboxy-terminus 
of the chromosomal copy of the gene encoding Rpbl is fused in-frame with a 
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TAP-tag. 18 liters Saccharomyces cerevisiae culture was grown In YPD 
medium to OD600 = 0.7. The complexes were purified on IgG beads and 
further Immobilized on calmodulin beads. The elution was perfonned with 0.5 
M KCI In the WBC. A minor release of the fusion protein from the afTinlty 
5 matrix was observed. 

Legend: 221 - Rpbl. 222 - Spt6, 223 - Spt6, 224 - Rpb2 + Ctr9. 225 - 
Ctr9 + Spte. 226 - Tfgl + Prp22 + Spt16 + Spt6, 227 - Tfgl + Fcp1 + Spt16 + 
NIpl. 228 - Fcp1 + Set2 + Rtfl + Ygl244, 229 - Tfg1 + Fcp1 + Ygr054 + Leo1 
+ Med1. 230 - Pob3, 231 - Paf1 + Glo3. 232 - Iwsl + Tfg2. 233 - TFg2, 234 - 
10 Sua7 (TFIIB homolog). 235 - TFIIS. 236 - Mpd2 . 237 - Tfg3. 238 - Essl. 239 
- Mbf1 . 

The following protocol was used: 

1. The cells were collected by centrlfugation at 4,000 rpm for 4 minutes 
and the medium was decanted. All subsequent manipulations were 

1 5 performed on Ice or at 0°C to 5°C. 

2. The cells were suspended in 200 ml water and distributed into 50ml 
plastic tubes; centrifuged at 4,000 rpm for 4 minutes and the water was 
decanted. 

3. Frozen in liquid nitrogen for 1 minute. 

20 4. The solid pellet was broken and protease Inhibitors and 2-3 pieces 

(around 8 cm3) dry ice were added; ground with a coffee grinder for 1 
minute; the powder was transfenred to a beaker and, after allowing it to 
melt, the same volume of 2x Lysis buffer was added. Stirred with a 
plastic rod for 2 minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES 

25 pH 7.4, 20% glycerol, 2% DMSO, 0.2% TritonXIOO. 2mM PMSF, 2mM 

benzamidine, 2x phosphatase inhibitors. 

5. Centrifuged at 16.000 rpm for 30 minutes and the supernatant was 
transferred to a clean tube. 

6. 800 microliters of IgG beads was added and the tube was rotated for 2 
30 hours. 

7. Centrifuged at 3.000 rpm for 2 minutes and the supernatant was 
decanted. 
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8. The IgG beads were transferred and distributed to three parallel 10ml 
chromatography columns. Each column was washed with 10ml 
washing buffer (WB) by applying 200 microliter allquots. WB: lOOmM 
KCI, 10mM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXIOO. 
5 9. Washed with 400 microliters WB containing 2mM DTT. 

10. The beads were transferred with 2ml WB containing 2mM DTT to two 
eppendorf tubes and 500 units Tev protease was added. Rotated at 
0°C to S^C for 2 hours. 

11. The beads were transfen^ed back to 10ml chromatography column and 
10 the eluates were collected, the beads were washed with two more 

column volumes of WB containing 2mM DTT and the eluates were 
combined in a 5 ml tube. 
12.500 microliters calmodulin beads and 5mM CaCb and 3 ml WBC were 
added. WBC: 100 mM KCI, 10 mM HEPES pH 7.4, 5 mM DTT. 5 mM 
15 CaCIa, 1 mM Immidazol, 5% glycerol, 0.1% TritonXIOO. Rotated for 

Ihr. The beads were transfenred to 10ml chromatography column and 
washed with 5ml WBC. 

13. The beads were transferred to a 5ml tube and 3ml WBC containing 
0.5M KCI (WBC (0.5)) was added. The tube was rotated for 10 

20 minutes. 

14. The beads and the buffer were transferred to a 10ml chromatography 
column and the eluate was collected by gravity flow. The beads were 
washed with two column volumes of WBC(0.5M) and the eluates were 
combined. 

25 1 5. The eluate was distributed to eppendorf tubes and 1/5 volume of 1 00% 

TCAwas added. 

16. Incubated on ice for 1 hour; centrifuged for 1 hour - 14,000 rpm at O^C. 

17. The supernatant was decanted and 1 ml 90% acetone was added. 
Vortexed for 30 seconds and centrifuged for 10 minutes at 14,000 rpm. 

30 The supematant was decanted. 

18. Dried with a SpeedVac for 15 seconds. 
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19. Protein gel electrophoresis (SDS-PAGE - 10% gel, 20 cm long) and 

silver staining were performed. 
A gel image of high salt eluate from immobilized RNA Polymerase II from 
Saccharomyces cemvisiae and another eluate containing the core complex is 
5 shown in Figure 11. The carboxy-terminus of the chromosomal copy of the 
gene encoding Rpbl is fused in-frame with a TAP-tag. 9 liters 
Saccharomyces cerevisiae culture was grown in YPD medium to OD600 = 
0.8. The protocol used for obtaining the high salt eluate shown in the right 
lane is the same as in Figure 10 except for the high salt elution that was 
10 performed with 0.3 M KCI in the washing buffer. After separating the 
interacting proteins by high salt elution and thus completing the method of the 
invention, the Immobilized RNA Polymerase 11 complexes were released by 
treatment with 5mM EGTA and 0.5% SDS in WB. Half of the eluate was 
loaded In the left lane and subjected to PAGE-SDS in order to demonstrate 
15 the ratio between the amounts of the interacting proteins and the immobilized 
core complex. All gel bands. Including the "blank" ones, were cut for analysis 
and the proteins that were identified with high confidence are indicated. Note 
the complete separation of the interacting proteins from the immobilized core 
complex by increasing the ionic strength (except TFIIF, which binds strongly 
to the core enzyme and was detected in both fractions). Different Intensities of 
the bands in the two gel lanes illustrate the need to separate the 
substolchiometrically interacting proteins from the immobilized core complex. 
Note that a double gel well was used to load only a half of the SDS/EGTA 
eluate In order to avoid overloading and smearing of the gel. 
25 Legend: 271 - Spt6. 272 - Ydl145 (Cop1), 273 - Ctr9 + Spt5. 274 - 

Tfgl + SptlS + Lte1, 275 - Tfgl + Spt5 + SptlB + Fcpl. 276 - Set2, 277 - 
Fcp1 + Leol, 278 - Pob3, 279 - Pafi, 280 - Pafi + Rtt103, 281 - Tfg2 + 
Iwsl. 282 - Tfg2. 283 - Cdc73 + Cegl, 284 - Dsti, 285 - Rtt103, 286 - Tfg3, 
287 - Essi, 288 - Rpbl. 289 - Rpb2, 290 - Tfg1, 291 - Tfg2. 292 - Rpb3, 
30 293 - Tfg3, 294 - Rpb4. 295 - Rpb5, 296 - Rpb6, 297 - Rpb7, 298 - Rpb8. ' 

A gel image of the high salt eluate from immobilized Rpb3 from 
Saccharomyces cerevisiae is shown In Figure 12. The carboxyl-terminus of 



20 
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the chromosomal copy of the gene encoding Rpb3 is fused In-frame with a 
TAP-tag. The protocol used for obtaining the high salt eluate is the same 
protocol used in Figure 10. Interacting proteins are indicated. In addition, trace 
amounts of the immobilized complex (Rpbl. Rpb2 and RpbS) are shown. 
6 The separation of the immobilized complex from the affinity matrix is 

not necessary for practicing the invention, it is outside the scope of the 
Invention and. when practicing the invention, the greatest care must be taken 
not to separate the immobilized fusion protein from the affinity matrix during 
the separation of the Interacting proteins from the immobilized fusion protein. 

10 The contaminants that were determined by perfonning the method of 

the invention with two Immobilizations on two different affinity matrices, i.e. 
IgG-sepharose and calmodulin beads, and with cellular lysate from organism 
devoid of affinity tagged protein, are: Tef2, Fks1, Fas2. Stml. Rpl4. Tlf34. 
Rpl5, RpsS, Rps1b. Rpl2, Prtl. Rpl8. Tef2, Rps2. Rps5. Rpl10. Ded1. Rpl9, 

15 Hsp60. RpsS. Gcd11. Rps20, Rpl20, Pab1. Rps17, Ssbl. Ssa2. Ssel. Rpl27, 
Rps14, Rps24. Mdnl, Tra1, Dyn1. Myo2, Ira2. 

Although the usage of dual affinity tag results in low background from 
the residual contaminant proteins, the number of the detected interacting 
proteins Is lower than the number of interacting proteins that are detected 

20 when a fusion protein with one tag (SpA-tag) Is used. It is clear that the basic 
method of the invention (i.e. the protein of interest Is fused to one affinity tag 
and only one immobilization of the protein of interest is performed before the 
separation of the interacting proteins) is faster, cheaper and results in 
Identifying more Interacting proteins. Compare the interacting proteins in 

25 Figure 3, 4, 6, 7 and 8 with Figure 9 and 10. Therefore, when the protein of 
Interest Is present in the cell in approximately 5,000 - 10.000 copies or higher, 
it is best if the basic method of the invention is carried out with a fusion protein 
containing one affinity tag. A good explanation for the loss of the interacting 
proteins when two sequential immobilization are performed Is given by R. 

30 Aebersold and M. Mann. Nature. March 2003. Vol. 422. The background from 
the contaminant proteins creates problems mostly for the identification and 
analysis of the InteracUng proteins by mass spectrometry. If they are analyzed 



wo 200S/0169S6 



PCT/CA2004/001448 



-63- 



by other methods, e.g. immunoblotting, the background can be ignored. Also, 
the background from the contaminant proteins is negligible when compared to 
the background from the affinity tagged protein (or other subunits of the stable 
protein complex) that Is encountered when a classical purification via affinity 
5 tag Is performed. 

The following proteins were identified by performing the method of the 
invention with several affinity tagged subunits of RNA Polymerase II from 
Saccharomyces cerevisiae. Proteins separated by a slash are subunits of 
stable protein complexes. 

10 (a) transcription factors that are known to bind to RNA Polymerase II 

and proteins associated with them: Tfg1/Tfg2/Tfg3 - TFIIF, SptS, 
TFIIS, Spt6/lws1, Fcp1. Cet1/Ceg1, TFIIB. Rgr1/Srb4/Med1/Med4 

- mediator subunits, Ppnl - interact with TFIIB, Nip1 - interact 
with TFIIB, Rtf1/Paf1/Ctr9/Leo1/Cdc73 - PAF complex, 
Spt16/Pob3, Taf7/Taf12 - TFliD, Ess1 , Rad3/Tfb2 - TFIIH, Tfa2 
(TRIE). 

(b) Proteins that are known to be involved in mRNA transcription or 
mRNA processing but are not known to associate directly or 
indirectly with the core enzyme: Npl3 -DNA binding; involved in 

^° mRNA transport, Hrp1 - mRNA cleavage/polyadenylation factor, 

Prp43 - mRNA splicing factor, Syfl - mRNA splicing factor, Smb1 

- mRNA splicing factor, Puf2 - mRNA binding protein, Gcd14 - 
RNA processing/modification factor, Fipl - mRNA polyadenylation 
factor, CIpl - cleavage-polyadenylation factor lA subunit, Ylr41 9 - 

2® f^NA helicase; involved in RNA splicing, Rgm1 - transcription 

factor, Rgt1 - transcriptional activator, Cdc39 - transcription 
factor, Ygl244 - transcription cofactor activity, Mbfl - bridges Gcn4 
and Spt15, Stb5 - transcriptional activator, 

(c) Proteins that interact with DNA/chromatin and/or mRNA and/or 
30 NTPs: Rap1- Interacts with Rif1. DNA-binding protein, Rif2 - 

interacts with Rap1 and with Rif1, Ddr48 - DNA damage inducible 
protein, Ddcl - involved in DNA damage checkpoint, Prp22 - 
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helicase-like protein, Prp18 - helicase-like protein, Cdc6 - 
GTPase/ATPase; cell cycle control, Ygr054 - part of Swi/Snf and 
Npl3 complexes, Sdc26 - GDP/GTP exchange factor for Ras, 
Glo3 - GTPase activator, 
5 (d) other proteins: Sap4 - associates with the SIT4 phosphatase, 

Sap185 - SIT4 associated protein, Ptc2 - protein phosphatase 
type 2C, Ubp8 - putative deubiquitinating enzyme, Kin4 - nuclear 
protein kinase. Hbtl - Hub1 target, Yhr121w - Stb5 associated, 
Sup35 - interacts with Mip6 and NabS, Sip1 - Snf1 protein kinase 
"•0 substrate, Rtt103 - binds to CTD, Mks1, Ydl145 (Cop1), Mpd2 - 

protein disulfide isomerase. More proteins are indicated in the 
Examples. 

The nucleotide sequences of the genes and amino acid sequences of 
the proteins can be found by submitting the ORF name or the name of the 

15 protein at: Saccharomyces Genome Database at yeastgenome.org/ or, at 
National Center for Biotechnology Information web site at 
ncbi.nlm.nih.gov:80/mapview/map_search.cgi?taxid=4932. 

Note that by using only one method, i.e. the method of the invention, it 
is possible to identify directly nearly all proteins that are known to interact with 

20 RNA Polymerase II and that were identified during the last 20 years by using 
various methods (including genetic based and library based methods). 

Note that among the detected proteins are two well-known enzymes 
that modify covalently the carboxyl-terminal domain (CTD) of RNA 
Polymerase il: (a) Fcp1 - a TFIIF interacting phosphatase that recycles RNA 

25 Polymerase II, (b) Ess1 - a prolyl isomerase of the CTD. Since post- 
translational modifications play a major role in modulating the protein function, 
the identification of the modifying enzyme for a protein of Interest is an 
important application of the invention. In addition, the Invention can be used to 
identify the substrate for an enzyme of interest. 

30 The presence of many (more than 30) well-established transcription 

factors in the high salt eluates obtained from immobilized RNA Polymerase II 
is enough for validation of the invention, i.e. the proteins isolated by the 
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method of the Invention are true Interacting proteins. One way to validate 
Interacting proteins is to perform the method of the Invention with a fusion 
protein containing the putative Interacting protein. The presence of the first 
protein of Interest among the Interacting proteins increases the confidence 
5 that the protein-protein Interaction Is physiologically relevant. For example, by 
performing the method of the Invention with tagged subunits of RNA 
Polymerase II, the three subunits of the transcription factor TFIIF (Tfg1, Tfg2 
and Tfg3) are detected in the high salt eluate. and. on the other hand, by 
perfomiing the method of the Invention with tagged Tfg1, the subunits of the 

10 RNA Polymerase II core enzyme are detected (Figures 7 and 8). 

Alternatively, the validation can be performed by other methods for 
detection of protein Interactions as described by Phizicky E. and Fields S. In 
Microbiological Reviews, Mar. 1995, Vol. 95, No. 1. or as described by 
Phizicky E. Bastiaens PI, Zhu H, Snyder M, Fields S., Nature 2003. For 

15 example, purification via an affinity tag fused to the putative Interactor can be 
performed. RttlOS Is among the interacting proteins detected by the method 
of the invention and, In order to validate It as an interactor, a tandem affinity 
purification was perfonned with a strain expressing TAP-tagged RttlOS. As 
shown In Figure 22. two subunits of the RNA Polymerase II core complex 

20 (Rpb1 and Rpb2) co-purified with RttlOS. Legend: 470 - Rpbl, 471 - Rpb2, 
472 - Rati , 47S - Rtt1 03, 474 - Rail . 
Example 5: Drug Screeni ng and Drug DiscoverY 

The Invention can be used for monitoring the effect of a pre-drug on a 
protein-protein interaction that Is associated with a disease. The protein can 

25 be fused to a SpA-tag but If the cellular lysate or other biological fluid contains 
a high level of Immunoglobulins or other proteins that bind to SpA-tag or IgG, 
a different affinity tag should be used (e.g. MBP or SBP or GST). In this 
example, the protein does not contain a pathogenic mutation. It Is best If a 
complete proteomic map has been built around the protein of Interest before 

30 the experiment, so that the presence of Interacting proteins in the eluate can 
be determined by Immunoassays or other sensitive methods. 
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Cellular lysate or a fraction of cellular lysate or other biological fluid 
from human cells containing a modified gene that encodes a fusion protein 
containing the SpA-tag is mixed with IgG-beads. After removing the unbound 
substances, the Immobilized complex is treated with washing buffer 
5 containing the pre-drug in concentration between 0 and ImM. The 
concentration of the pre-drug can vary. The effect of the pre-drug is 
determined by monitoring the presence of interacting proteins in the eluate. 
The following protocol can be used (all manipulations are performed at 0**C - 
5°C): 

10 1. The cells are collected and frozen in liquid nitrogen for 1 minute. After 

melting and adding lysis buffer, the lysate is homogenized by 10-20 
strokes. Cellular lysate from 5x10® mammalian cells in 10ml Lysis 
Buffer (50mM KCI, 10mM HEPES pH 7.4, 10% glycerol, 1% DMSO, 
0.1% TrItonXlOO, 0.2mM PMSF, 0.2mM benzamidine.) is centrifuged 

15 at 16,000 rpm for 20 minutes and the supernatant is transferred to a 

clean tube. 

2. Adding 100 microliters of IgG beads and rotating the tube for 1.5 hours. 

3. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

20 4. Transferring the beads to a 10 ml chromatography column. Washing 

the column with 10ml washing buffer (WB) by applying 200 microliter 
aliquots (WB - 60ml\/l KCI. lOmM HEPES pH 7.4, 5% glycerol, 0.2% 
DMSO. 0.1% TrItonXlOO, 0.2mM PMSF, 0.2mM benzamidine). 

5. Transfenring the beads to a 2ml tube and adding 0.5 ml WB containing 
25 1 mM pre-drug. 

6. Rotating the tube for 30 minutes. 

7. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 
with two column volumes of WB containing ImM pre-drug and 

30 combining the eluates. 
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8. Analyzing the eluate for the presence or absence of Interacting proteins 
or other biomolecules in order to detemriine the effect of the pre-drug 
on the complex. 

The above method can be modified in several ways: 
5 (I) The fusion protein contains a pathogenic mutation and the pre-drug is 
present in all solutions: growing medium, lysis buffer and washing buffer. In 
steps 5. 6 and 7, the elution is performed by WB without a pre-drug in order to 
check whether the absence of the pre-drug leads to dissociation of the 
complex. 

10 (II) Another variant is to perform the treatment in steps 5, 6 and 7 with WB 
containing 0.3M KCI and a pre-drug. Presumably, the concentration of the 
drug should be lower when it is applied to living cells (i.e. in vivo) and higher 
when it is applied to the buffers (i.e. in vitro). 

If the pre-drug affects the complex at 1 mM concentration, the 

15 experiment can be repeated with lower concentrations of the pre-drug. e.g. 
lOOmcM or 10 mcM or ImcM. in order to determine the lowest concentration 
that affects the complex. After conducting several experiments, the pre-drug 
can be modified and experiments can be repeated in order to determine 
whether the modification increases the specificity of the pre-drug. i.e. whether 

20 the pre-drug affects the complex at lower concentrations. 

In another example, before conducting the drug treatment experiments, 
a proteomic map can be built around the protein of interest so that it is known 
which protein-protein Interaction is associated with a particular disease. In this 
case, both proteins can be tagged with different tags and the interacting 

25 protein can be detected by immunoassays or other assays in order to reduce 
the scale of the experiment. Columns containing as low as 10 mcl or 20 mcl 
affinity matrix can be used (Formosa T., Methods Enzymol. 1991). 
Alternatively, if both proteins are present in a very low copy number in the cell. " 
they can be over-expressed as fusion proteins In an exogenous or 

30 endogenous organism and the experiment can be performed with purified 
proteins. In a further example, the proteins can be over-expressed in their 
endogenous organism and the regular method of the invention can be earned 
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out. Again, the over-expression sliould not lead to formation of inclusion 
bodies. 

Two types of control experiments can be performed in order to validate 
the effect of the pre-drug: 

5 (I) step 5, 6 and 7 are performed without a pre-drug; or 

(II) testing the effect of a pre-drug on one or more biologically unrelated 
protein complexes. 

Figure 19 illustrates the selective effect of a drug on a particular 
protein-protein interaction. 

10 Legend: 430 - left panel: the Immobilized protein of interest is treated 

with nonspecific elution agent and all interacting proteins are separated, 431 - 
middle panel: the immobilized protein of interest is treated with specific elution 
agent which causes separation of only one interacting protein complex, 432 - 
right panel: the unrelated protein complex is treated with the same specific 

15 elution agent but there is no separation of the interacting proteins, 438 - 
unrelated protein, 439 - protein interacting with the unrelated protein, 449 ~ 
another protein Interacting with the unrelated protein, 440 - elution with 
nonselective elution agent (e.g. increased salt concentration), 441 - elution 
with selective agent (e.g. drug treatment), 18 - eluate obtained by the method 

20 of the invention, i.e. proteins eluted by treatment with a nonselective elution 
agent, 445 - proteins eluted by treatment with a selective agent; 446 - the 
unrelated protein complex is not affected by the treatment with a selective 
agent and the interacting proteins are not eluted. 

25 Example 6: Benefits a nd Features of the Method of the Invention. 

Figure 13 shows a comparison between the method of the invention 
and other methods for detecting protein-protein interactions that utilize affinity 
tagged proteins. 

Legend: 301 - the left panel illustrates GST pull down, 302 - the middle 
30 panel illustrates purification via an affinity tag, 303 - the right panel illustrates 
the method of the Invention. 310 - exogenous organism (e.g. Escherichia 
coli), 311 - endogenous organism, 312 - affinity tagged protein, 313 - the 
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affinity tagged protein and the interacting proteins are Immobilized on solid 
phase, 314 - unbound proteins and/or other substances (solid black) are 
removed, 315 - Isolating the interacting proteins - only in the case of the 
invention they are separated from the affinity tagged protein (the eluate does 
5 not contain affinity tagged protein), 316 - eluate obtained by GST pull down; 
317 - eluate obtained by purification via an affinity tag; 18 - eluate obtained by 
the method of the invention. 

GST pull down - a single fusion protein is expressed in an exogenous 
organism (most often E. coll) and is used to prepare the affinity column. GST 
10 pull down can be perfonmed by separating the fusion protein from the affinity 
matrix or by separating the interacting proteins from the immobilized protein of 
interest. 

Purification via an affinity tag - the fusion protein and the interacting 
proteins are isolated together. Only in rare cases can transient complexes be 
1 5 detected. 

The Invention - the substoichiometrlcally Interacting proteins are 
separated from the Immobilized fusion protein and the dynamic range problem 
and/or other problems are eliminated. 

Figure 14 illustrates the necessity to separate substoichiometrlcally 
20 interacting proteins from the high abundance fusion protein. The bulk of the 
fusion protein is immobilized via an affinity tag whereas only the fraction of 
interacting proteins that has been bound to the fusion protein in vivo is 
Immobilized. 

Legend: 330 - transient complexes are immobilized on an affinity 
25 matrix via an affinity tag, 331 - purification via an affinity tag, 332 - 
purification by the method of the invention, 317 - eluate obtained by 
purification via an affinity tag, 1 8 - eluate obtained by the method of the 
invention. 

Note the amount of the tagged protein that Is not associated with 
30 Interacting proteins. In the Illustration the ratio between the fusion protein and 
each Interacting protein is 4:1 but in reality, the ratio can vary between 10:1 
and more than 100:1. Moreover, when the fusion protein Is a subunit of a 
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stable protein complex, the problems arising from the different stoichiometries 
are even more complicated. 

Figure 16 illustrates the formation and disintegration of a transient 
protein complex between two stable (pemianent) protein complexes (first and 
5 second vertical arrows respectively). 

Legend: 351 - stable protein complex with 4 subunits, 352 - stable 
protein complex with 6 subunits, 353 - transient protein complex between the 
two stable complexes, 354 - subunit of a stable complex that binds indirectly 
to another stable complex, 355 - fomiation of a transient protein complex, 356 
10 - disintegration of the complex. 

Figure 16. A model for transient protein-protein interactions according 
to which fomiation and disintegration of transient protein complexes is due to 
alternation of electrostatic attraction and electrostatic repulsion between the 
electrostatically charged amino acids. 

15 Legend: 370 - first interacting protein, 371 - second interacting protein, 

372 - positively charged amino acids, 373 - negatively charged amino acids, 
374 - electrostatic attraction, 375 - two interacting proteins form a complex, 
376 - change of position or orientation of the two interacting proteins relative 
to one another, 377 - electrostatic repulsion, 378 - transient complex 

20 disintegrates. 

Figure 17. When a mutation changes the electrostatic properties of the 
protein, the transient interaction cannot occur property. 

Legend for the upper panel: 380 - mutation leading to disappearance 
of positively charged amino acid, 381 - electrostatic attraction is prevented. 
25 Legend for the lower panel: 382 - mutation leading to disappearance of 
different positively charged amino acid, 383 - electrostatic repulsion Is 
prevented and the interacting proteins remain stuck together. 

Figure 18. Method for drug design and schematic structure of a 
chemical or a biomolecule that binds to a mutant protein and restores (at least 
30 partially) its electrostatic properties. As a result, the protein is capable of 
forming a protein complex. 
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Legend: 390 - drug treatment, 391 - a drug, 392 - an exploded view of 
the drug, 393 - part that mimics and/or restores the original charge, 394 - 
linker (optional), 395 - part that binds selectively to the mutant protein. 

When the disease is caused by a mutation that converts a permanent 
5 protein-protein association into a weak interaction, the drug should stabilize 
the protein-protein association. Drugs that are designed or discovered 
according to the invention, can be defined as chemicals that restore the 
equilibrium binding constant of a mutant protein complex back to that of the 
respective native one. 
10 If a human disease has an analog in another organism (e.g. mouse or 

rat), a heterologous nucleic acid can be administered into the organism so 
that it expresses an affinity tagged protein of interest. In this case, the method 
of the invention can be performed with cellular lysate or other biological fluids 
from the sacrificed organism. 

15 

Example 7: Variations of the Method with respect to the Affinitv Tag 

The following examples illustrate that the methods of the invention can 
be carried out with different affinity tags. The scale of the experiments is 
suitable for detection by mass spectrometry of proteins interacting with a 
20 protein of interest that is present in approximately 5,000 - 10,000 copies per 
cell or more. 

If a heterologous nucleic acid is introduced into a Sacchanomyces 
cerevisiae strain such that the organism can express a fusion protein 
containing a GST-tag, then the following protocol can be used: 
25 1 . Growing 12 liters Saccharomyces cerevisiae culture in YPD medium to 

OD600 = 0.8. 

2. Collecting the cells by centrifugation at 4,000 rpm for 4 minutes and 
decanting the medium. All subsequent manipulations are performed on 
ice or at 0°C to 5°C. 

30 3. Suspending the cells in 100 ml water and distributing into 50ml plastic 

tubes; centrifugation at 4.000 rpm for 4 minutes and decanting the 
water. 
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4. Freezing in liquid nitrogen for 1 minute. 

5. Breaking the solid pellet and adding protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry Ice; grinding with a coffee grinder for 1 minute; 
transferring the powder to a beaker and, after allowing it to melt, adding 

5 the same volume of 2x Lysis buffer. Stirring with a plastic rod for 2 

minutes. 2x Lysis Buffer: lOOmM KCI, 50mM HEPES pH 7.4, 20% 
glycerol. 2% DMSO, 0.2% TritonXlOO, 2mM PMSF. 2mM benzamidlne. 

6. Centrifugatlon at 16,000 rpm for 30 minutes and transferring the 
supernatant to a clean tube. 

10 7. Adding 500 microliters of glutathione beads and rotating the tube for 2 

hours. 

8. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

9. Transferring and distributing the glutathione beads to two parallel 10ml 
15 chromatography columns. Washing each column with 10ml washing 

buffer (WB) by applying 200 microliter aliquots. WB: 50mM KCI, lOmM 
HEPES pH 7.4, 5% glycerol. 1 % DMSO, 0. 1 % TritonXI 00. 

10. Transferring the beads to a 5ml tube and adding 4ml WB with 0.4M 
KCI (WB(0.4M)). 

20 11. Rotating the tube for 1 0 minutes. 

12. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eiuate by gravity flow. Washing the beads 
with two column volumes of WB(0.4M) and combining the eluates. 

13. Analyzing the interacting proteins and/or other blomolecules in the 
25 eiuate. Optionally, proceed to 14. 

14. Distributing the eiuate to eppendorf tubes and adding 1/5 volume of 
100% TCA. 

16. Incubating on ice for 30 minutes; centrifugation for 30 minutes - 14,000 
Rpm at O^C to 5''C. 

30 16. Decanting the supernatant and adding 1 ml 90% acetone. Vortexing for 

30 seconds and centrifuging for 10 minutes at 14,000 rpm. Decanting 
the supernatant. 
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17. Drying with SpeedVac for 15 seconds. 

18. Protein gel electrophoresis (SDS-PAGE - 10% gel, 20 cm long) and 
silver staining. 

\Nhen the interacting proteins are isolated in low amounts and an 
5 additional affinity purification step is needed before the separation from the 
immobilized fusion protein, a variety of combinations of two affinity tags can 
be used. For example, the combination can be a GST-tag and a SpA-tag. The 
following protocol can be used In several cases: 

(a) One subunit of protein complex is fused to a GST-tag and another 
10 subunit is fused to a SpA-tag, 

(b) The protein or subunit of the protein complex contains a GST-tag at the 
N-terminus and a SpA-tag at the C-terminus, 

(c) Both tags can be located at the same end. 

1. Growing 18 liters Saccharomyces cerevisiae culture in YPD medium to 
15 OD600 = 0.8. 

2. Collecting the cells by centrifugation at 4,000 rpm for 4 minutes and 
decanting the medium. All subsequent manipulations are performed on 
ice or at 0°C to 5°C. 

3. Suspending the cells in 200 ml water and distributing into 50ml plastic 
20 tubes; centrifugation at 4,000 rpm for 4 minutes and decanting the 

water. 

4. Freezing in liquid nitrogen for 1 minute. 

5. Breaking the solid pellet and adding protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice; grinding with a coffee grinder for 1 minute; 

25 transferring the powder to a beaker and, after allowing it to melt, adding 

the same volume of 2x Lysis buffer. Stirring with a plastic rod for 2 
minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% 
glycerol, 2% DMSO, 0.2% TritonXlOO, 2mM PMSF. 2mM benzamidine. 

6. Centrifugation at 16,000 rpm for 30 minutes and transferring the 
30 supernatant to a clean tube. 

7. Adding 800 microliters of glutathione beads and rotating the tube for 2 
hours. 
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8. Centrifugatlon at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

9. Transferring and distributing the glutathione beads to four parallel 10ml 
chromatography columns. Washing each column with 10ml washing 

5 buffer (WB) by applying 200 microliter aliquots. WB: lOOmM KCI, 

lOmM HEPES pH 7.4, 5% glycerol, 1% DMSO. 0.1% TritonXIOO. 

10. Transferring the beads to a 5ml tube and adding 3ml WB containing 10 
mM reduced glutathione. 

1 1 . Rotating the tube for 1 0 minutes. 

10 12. Transferring the beads and the buffer to a 10ml chromatography 

column and collecting the eluate by gravity flow. Washing the beads 
with two column volumes of WB containing 10 mM reduced glutathione 
and combining the eluates in a 5 ml tube. 

13. Adding 600 microliters of IgG beads to the eluate and rotating the tube 
15 for 1 hour. 

14. Centrifugatlon at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

15. Transferring and distributing the IgG beads to two parallel 10ml 
chromatography columns. Washing each column with 10ml WB by 

20 applying 200 microliter aliquots. 

16. Transferring the beads to a 5ml tube and adding 3ml WB containing 
0.5M KCI (WB(0.5M)). Rotating the tube for 10 minutes. 

17. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 

25 with two column volumes of WB(0.5M) and combining the eluates. 

18. Analyzing the eluate. 

Another example is a combination of a SpG-tag and a SBP (Streptavidin 
binding peptide). The following protocol can be used: 

1 . Growing 12 liters Saccharomyces cerevisiae culture In YPD medium to 
30 OD600 = 0.9. 
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2. Collecting the cells by centrifugation at 4,000 rpm for 4 minutes and 
decanting the medium. All subsequent manipulations are performed on 
ice or at 0°C to 5°C. 

3. Suspending the cells in 200 ml water and distributing into 50ml plastic 
tubes, centrifugation at 4,000 rpm for 4 minutes and decanting the 
water. 

4. Freezing in liquid nitrogen for 1 minute. 

5. Breaking the solid pellet and adding protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice; grinding with a coffee grinder for 1 minute; 
transferring the powder to a beaker and, after allowing it to melt, adding 
the same volume 2x Lysis buffer. Stirring with a plastic rod for 2 
minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% 
glycerol, 2% DMSO, 0.2% TritonXlOO, 2mM PMSF, 2mM benzamidine. 

6. Centrifugation at 16,000 rpm for 20 minutes and transferring the 
supernatant to a clean tube. 

7. Adding 600 microliters streptavidin beads and rotating the tube for 1.5 
hours. 

8. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

9. Transferring and distributing the streptavidin beads to three parallel 
10ml chromatography columns. Washing each column with 10ml 
washing buffer (WB) by applying 200 microliter aliquots. WB: lOOmM 
KCI. 10mM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXlOO. 

10. Transferring the beads to a 5ml tube and adding 4ml WB containing 
4mM biotin. 

1 1 . Rotating the tube for 1 0 minutes. 

12. Transferring the beads and the buffer to 10ml chromatography column 
and collecting the eluate by gravity flow. Washing the beads with two 
column volumes of WB containing 4mM biotin and combining the 
eluates in a 5 ml tube. 

13. Adding 600 microliters of IgG beads to the eluate and rotating the tube 
for 1 hour. 
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14. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

15. Transferring and distributing the IgG beads to three parallel 10ml 
chromatography columns. Washing each column with 10ml WB by 

5 applying 200 microliter aliquots. 

16. Transferring the beads to a 5ml tube and adding 3.5 mi WB containing 
0.5M KCI (WB(0.5M)). Rotating the tube for 10 minutes. 

1 7. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 

10 with two column volumes of WB(0.5M) and combining the eluates. 

18. Analyzing the eluate. 

Another example is a combination of a IVIBP and a HA-tag. The 
following protocol can be used: 

1. Growing 12 liters Saccharomyces cerevisiae culture in YPD medium to 
15 OD600 = 0.9. 

2. Collecting the cells by centrifugation at 4,000 rpm for 4 minutes and 
decanting the medium. All subsequent manipulations are performed on 
Ice or at O^'C to 5°C. 

3. Suspending the cells in 200 ml water and distributing into 50ml plastic 
20 tubes, centrifugation at 4,000 rpm for 4 minutes and decanting the 

water. 

4. Freezing in liquid nitrogen for 1 minute. 

5. Breaking the solid pellet and adding protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice; grinding with a coffee grinder for 1 minute; 

25 transferring the powder to a beaker and, after allowing it to melt, adding 

the same volume 2x Lysis buffer. Stirring with a plastic rod for 2 
minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% 
glycerol, 2% DMSO, 0.2% TritonXlOO, 2mM PMSF, 2mM benzamidlne. 

6. Centrifugation at 16,000 rpm for 20 minutes and transferring the 
30 supernatant to a clean tube. 

7. Adding 800 microliters of anti-HA affinity matrix and rotating the tube 
for 2 hours. 
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8. Centrifugation at 3.000 rpm for 2 minutes and decanting the 
supernatant. 

9. Transferring and distributing the antl-HA affinity matrix to four parallel 
10ml chromatography columns. Washing each column with 10ml 

5 washing buffer (WB) by applying 200 microliter aliquots. WB: lOOmM 

KCI, 10mM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXIOO. 

10. Transferring the beads to a 5 ml tube and adding 3 ml WB containing 
HA-tag at 1 mg/ml. 

1 1 . Rotating the tube for 10 minutes. 

10 12. Transferring the beads and the buffer to 10ml chromatography column 

and collecting the eluate by gravity flow. Washing the beads with two 
column volumes of WB containing HA peptide at 1 mg/ml and 
combining the eluates in a 10 ml tube. 

13. Adding 600 microliters of cross-linked amylose beads to the eluate and 
15 rotating the tube for 1 hour. 

14. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

15. Transferring and distributing the beads to three parallel 10ml 
chromatography columns. Washing each column with 10ml WB by 

20 applying 200 microliter aliquots. 

16. Transferring the beads to a 5ml tube and adding 3.6 ml WB containing 
0.5M KCI (WB(0.5M)). Rotating the tube for 10 minutes. 

17. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 

25 with two column volumes of WB(0.5M) and combining the eluates. 

18. Analyzing the eluate. 

Another example is a combination of a SBP (Streptavidin binding 
peptide) and a poly-His tag. 
The following protocol can be used: 

30 1. Growing 18 liters Sacxhammyces cerevisiae culture In YPD medium to 

OD600 = 0.7. 
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2. Collecting the cells by centrifugation at 4,000 rpm for 4 minutes and 
decanting the medium. All subsequent manipulations are performed on 
ice or at O^C to S^'C. 

3. Suspending the cells in 200 ml water and distributing into 50ml plastic 
5 tubes; centrifugation at 4,000 rpm for 4 minutes and decanting the 

water. 

4. Freezing in liquid nitrogen for 1 minute, 

5. Breaking the solid pellet and adding protease inhibitors and 2-3 pieces 
(around 8 cm^) of dry ice; grinding with a coffee grinder for 1 minute; 

10 transfening the powder to a beaker and, after allowing it to melt, adding 

the same volume of 2x Lysis buffer. Stirring with a plastic rod for 2 
minutes. 2x Lysis Buffer: 200mM KCI, 50mM HEPES pH 7.4, 20% 
glycerol, 2% DMSO. 0.2% TritonXlOO, 2mM PMSF, 2mM benzamidine. 

6. Centrifugation at 16,000 rpm for 20 minutes and transferring the 
15 supernatant to a clean tube. 

7. Adding 800 microliters streptavidin beads and rotating the tube for 1.5 
hours. 

8. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

20 9. Transferring and distributing the streptavidin beads to four parallel 10ml 

chromatography columns. Washing each column with 10ml washing 
buffer (WB) by applying 200 microliter aliquots. WB: lOOmM KCI, 
lOmM HEPES pH 7.4, 5% glycerol, 1% DMSO, 0.1% TritonXlOO. 

10. Transferring the beads to a 5ml tube and adding 4ml WB containing 
25 4mM biotin. 

1 1 . Rotating the tube for 1 0 minutes. 

12. Transferring the beads and the buffer to lOmi chromatography column 
and collecting the eluate by gravity flow. Washing the beads with two 
column volumes of WB containing 4mM biotin and combining the 

30 eluates in a 1 0 ml tube. 

13. Adding 600 microliters of Ni2+-NTA resin to the eluate and rotating the 
tube for 1 hour. 
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U.Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

1 5. Transferring and distributing tlie Ni2+-NTA resin to three parallel 10ml 
chromatography columns. Washing each column with 10ml WB by 

5 applying 200 microliter aliquots. 

16. Transferring the Ni2+-NTA resin to a 5ml tube and adding 3.5 ml WB 
containing 0.4M KCI (WB(0.4M)). Rotating the tube for 10 minutes. 

17. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 

1 0 with two column volumes of WB(0.4M) and combining the eluates. 

18. Analyzing the eluate. 

When TAP-tag is used and the first immobilization is on calmodulin 
beads, the iysis buffer and washing buffer should contain 2mM Ca++. After 
removing the unbound substances and releasing the fusion protein by 2mM 
15 EGTA treatment, the fusion protein is immobilized on IgG beads and the 
residual unbound substances are removed. The separation of the Interacting 
proteins and/or other substances from the immobilized fusion protein is 
perfomned with WB containing 0.6 M KCI. 

Examples of carrying out the invention with a protein of interest devoid 
20 of an affinity tag: 

A specific antibody can be raised against the protein of interest and 
subsequently immobilized on a solid support. Cellular lysate or a fraction of 
cellular lysate from an organism lacking an affinity tagged protein is mixed 
with antibody coated solid support and after removing the unbound 
25 substances, blomolecules that are bound to the protein of interest are 
separated from it by increasing the salt concentration to 0.3M KCI. The 
following protocol can be used (all manipulations are performed at 0°C to 
5°C): 

1 . The cells are collected and frozen in liquid nitrogen for 1 minute. After 
30 melting and adding lysis buffer, the lysate is homogenized by 10-20 

strokes. 
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2. Cellular lysate from 1010 mammalian cells in 10ml Lysis Buffer (50mM 
KCI, 10mM HEPES pH 7.4, 10% glycerol. 1% DMSO, 0.1% 
TritonXlOO, ImM PMSF, ImM benzamidine) is centrifuged at 15,000 
rpm for 30 minutes and the supernatant is transfen-ed to a clean tube. 
5 3. Adding 200 microliters of antibody coated beads and rotating the tube 

for 1 .5 hours. 

4. Centrifugation at 2,000 rpm for 3 minutes and decanting the 
supematant. 

5. Transferring the beads to a 10ml chromatography column. Washing the 
10 column with 10ml washing buffer (WB) by applying 200 microliter 

aliquots (WB: 50mM KCI, lOmM HEPES pH 7.4, 7% glycerol, 1% 
DMSO, 0.1% TritonXlOO). 

6. Transferring the beads to a 5ml tube and adding 3ml WB containing 
0.3M KCI (WB(0.3M)). 

15 7. Rotating the tube for 1 0 minutes. 

8. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 
with two column volumes of WB(0.3M) and combining the eluates. 

9. Analyzing the eluate. 

20 The previous protocol can be modified in order to avoid the 

immobilization of the antibody. An IgG type antibody can be raised against the 
protein of interest and, after binding to the corresponding protein of interest, 
the antibody can be immobilized on Protein A - Sepharose. After removal of 
the unbound substances, biomolecules that are bound to the protein of 

25 Interest are separated from it by increasing the salt concentration to 0.35M 
KCI. The following protocol can be used (all manipulations are performed at 
0°C to 5°C): 

1 . The cells are collected and frozen In liquid nitrogen for 1 minute. After 
melting and adding lysis buffer, the lysate is homogenized by 10-20 

30 strokes. 

2. Cellular lysate from 5x109 mammalian cells in 10ml Lysis Buffer 
(50mM KCI, 10mM HEPES pH 7.4, 10% glycerol, 1% DMSO. 0.1% 
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TritonXlOO, 1mM PMSF, 1mM benzamidine. 0.5. mM MgCI2, 0.1 mM 
DTT) is centrifuged at 15,000 rpm for 20 minutes and the supernatant 
is transferred to a clean tube. 
3. Adding tlie antibody and incubating for 1 .5 hours. 
5 4. Adding 300 microliters of Protein A-sepharose beads and rotating the 

tube for 1 hour. 

5. Centrifugation at 2,000 rpm for 3 minutes and decanting the 
supernatant. 

6. Transferring the beads to a 10ml chromatography column. Washing the 
10 column with 10ml washing buffer (WB) by applying 200 microliter 

aliquots (WB - 50mM KCI, lOmM HEPES pH 7.4, 7% glycerol, 1% 
DMSO. 0.1% TritonXlOO, 0.5 mM MgCI2, 0.1 mM DTT). 

7. Transferring the beads to a 5ml tube and adding 3ml WB containing 
0.35M KCI (WB(0.35M)). 

15 8. Rotating the tube for 1 5 minutes. 

9. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 
with two column volumes of WB (0.35M) and combining the eluates. 

1 0. Analyzing the eluate. 

20 Another variation consists of using biotinylated antibodies and utilizing 

their strong binding to streptavidin. The antibody can be biotinylated and 
added to the cellular lysate and, after 1 hour incubation, the lysate can be 
mixed with streptavidin coated sepharose beads. The following protocol can 
be used (all manipulations are performed at 0°C to 5*^0): 
25 1 . The cells are collected and frozen in liquid nitrogen for 1 minute. After 

melting and adding lysis buffer, the lysate is homogenized by 10-20 
strokes. 

2. Cellular lysate from 1010 mammalian cells in 10ml Lysis Buffer (50mM 
KCI, lOmM HEPES pH 7.4, 10% glycerol, 1% DMSO, 0.1% 
30 TritonXlOO, ImM PMSF, ImM benzamidine, 0.5 mM MgCI2, 0.1 mM 

DTT) is centrifuged at 16,000 rpm for 30 minutes and the supernatant 
is transferred to a clean tube. 



wo 2005/016956 



PCT/CA2004/001448 



-82- 

3. Adding biotinylated antibody and rotating the tube for 1 .5 hours. 

4. Adding 200 microliters of streptavidin-sepharose beads and rotating 
the tube for 1 hour. 

5. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
5 supernatant. 

6. Transferring the beads to 10ml chromatography column. Washing the 
column with 10ml washing buffer (WB) by applying 200 microliter 
aliquots (WB - 50mM KCI, lOmM HEPES pH 7.4. 5% glycerol, 1% 
DMSO, 0.1% TritonXlOO). 

10 7, Transferring the beads to a 5ml tube and adding 4ml WB containing 

0.3M KCI (WB(0.3M)). 

8. Rotating the tube for 10 minutes. 

9. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 

15 with two column volumes of WB(0.3M) and combining the eluates. 

10. Analyzing the eluate. 

Proteins that interact with glycoproteins can be isolated by the method 
of the invention. Cellular lysate or a fraction of cellular lysate or other 
biological fluid from an organism or cell culture without a genetically modified 

20 gene (i.e. devoid of an afTinity tag) is mixed with lectin-coated beads in order 
to immobilize the glycoproteins. After removing the unbound substances, 
biomolecules that interact with glycoproteins are separated from them by 
increasing the salt concentration to 0.4 - 0.5 M KCI. In this case, what is 
isolated is not a group of proteins that interact with a particular glycoprotein, 

25 but proteins or other substances that interact with any glycoprotein that is 
immobilized on the lectin matrix. Since glycolipids and polysaccharides also 
bind to lectin, it is recommended to perform one purification step before the 
method of the invention in order to obtain a fraction enriched in glycoproteins. 
The following protocol can be used (all manipulations are performed at O^C to 

30 5''C): 



wo 2005/016956 



PCT/CA2004/001448 



-83- 

1 . The cells are collected and frozen in liquid nitrogen for 1 minute. After 
melting and adding lysis buffer, the lysate is homogenized by 10-20 
strokes. 

2. Cellular lysate from 1010 mammalian ceils in 10ml Lysis Buffer (50mM 
5 KCI, lOmM HEPES pH 7.4, 10% glycerol, 1% DMSO. 0.1% 

TritonXlOO, 1mM PMSF, ImM benzamidine, ImM MnCI2, ImM CaCI2) 
is centrifuged at 16,000 rpm for 30 minutes and the supernatant is 
transferred to a clean tube. (Optionally, the lysate can be fractionated 
in order to obtain a fraction that is enriched in glycoproteins or a 
1 0 particular g lycoprotein) . 

3. Adding 200 microliters of lectin coated beads and rotating the tube for 
1 hour. 

4. Centrifugation at 3,000 rpm for 2 minutes and decanting the 
supernatant. 

15 5. Transferring the lectin beads to a 10ml chromatography column. 

Washing the column with 10ml washing buffer (WB) by applying 200 
microliter aliquots (WB - 50mM KCI, lOmM HEPES pH 7.4, 5% 
glycerol, 1% DMSO, 0.1% TritonXlOO, 1mM MnCla. ImM CaCb). 

6. Transferring the beads to a 5mi tube and adding 3.5ml WB containing 
20 0.4M KCI (WB(0.4M)). 

7. Rotating the tube for 1 0 minutes. 

8. Transferring the beads and the buffer to a 10ml chromatography 
column and collecting the eluate by gravity flow. Washing the beads 
with two column volumes of WB(0.4M) and combining the eluates. 

25 9. Analyzing the eluate. 

Another way to immobilize the protein of interest is by metal chelate 
affinity (e.g. poly-His tag) and to elute the interacting proteins by increasing 
the ionic strength of the medium. 

In addition, the protein of interest can be immobilized by predominantly 
30 electrostatic forces (e.g. by Flag-tag) and the elution can be performed with 
an agent that affects the strength of hydrophobic bonds (e.g. ethylene glycol). 
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The elution agent might partially disrupt the bonding between the 
affinity tag and the affinity matrix. Figure 20 illustrates elution by a gradient 
5 and collecting different fractions and analyzing each one (by immunodetection 
for example) for the presence of the affinity tag or the affinity tagged protein: 
401 - chromatography column; 402 - nine subsequent fractions, small amount 
of each one are blotted to a membrane and analyzed; 411, 412, 413, 414, 
415, 416 - blots corresponding to fractions that do not contain affinity tagged 
10 protein, 417. 418, 419 - blots corresponding to fractions that contain 
increasing amounts of the affinity tagged protein. 

Figure 21 illustrates the problems with analyzing fractions or gel bands 
containing proteins with different abundances by mass spectrometry. After the 
trypsin digest, the concentration range of the peptides remains approximately 
15 the same and it results in mass spectrum containing peaks with different 
intensity. 

Left panel: When the mass spectrum contains large and small peaks 
separated by only a few (1-5) mass units, the detection of the latter is 
impossible because of the dynamic range problem in mass spectrometry. 
20 Analyzing such spectrum leads to the identification of only the major protein(s) 
in the band. 

Right panel: High background noise problem in mass spectrometry - 
when a protein mixture with high dynamic range (ratio between the most 
abundant and the least abundant protein) is analyzed directly the small 
25 peptides are not detected because their intensities are lower than the 
background. 

Legend: 450 - high abundance peptide; 451 - different low abundance 
peptides; 452 - background of the mass spectrum; 453 - low abundance 
peptide is not detected because it is obscured by a high abundance peptide 
30 which mass is two mass units smaller; 454 - the separation of the 
substoichiometrically interacting proteins from the high abundance protein of 
interest results in lower background (due to the absence of the high 
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abundance peptides) and makes possible the detection of the low abundance 
peptides. 

While the present invention has been described with reference to what 
are presently considered to be the preferred examples, it is to be understood 
6 that the invention is not limited to the disclosed examples. To the contrary, the 
invention is intended to cover various modifications and equivalent 
arrangements included within the spirit and scope of the appended claims. 

Ail publications, patents and patent applications are herein 
incorporated by reference in their entirety to the same extent as if each 
10 individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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Table 1 : List of Diseases 

List of diseases caused by a point mutations that lead to disappearance or 
appearance of charged amino acids (some exceptions are included). Some of 
the mutations cpuld be pathogenic by affecting the protein folding and/or 
5 changing the protein structure. However, some might make impossible the 
formation or disintegration of transient protein complexes, and/or lead to 
formation of unnatural permanent protein complexes (e.g. Sickle Cell 
Anemia). 



10 



Mutation (Substitution) 


Disease 


E6V 


Sickle Cell Anemia 

(formation of unnatural protein complexes) 

The first illness understood at the molecular level. 


T226R 


Mutation in the human voltage-gated potassium 
channel gene (Kvl. 1) associated with: 
1 -episodic ataxia type 1 
2'-partia[ epilepsv 


D79N 


Epilepsy 


G->R 


ACH - achondroplasia 

All persons with ACH exhibit the mutation. 


R245X 


Usher syndrome 
Type1 gene. PCDH15 


E200K 


Creutzfeldt-Jakob Disease (CJD) 


G12R 
G12D 


Pancreatic Adenocarcinoma - 

Mutational in the K-ras oncogene at codon 12 has 

been demonstrated in 71 to 100% of the cases. 


R506Q 


Factor V Leiden of inherited thromboDhilia 
(hypercoagulable state is caused by Activated Protein 
C Resistance) 


R79C 
R79H 
R239C 
M73R R79G 
R88C Y242D 
E373XR416W 


Alexander's Disease GFAP gene 


H63D 


Acute iron poisoning 


H63D 
C282Y 


Hereditary hemochromatosis (HH) 


R->G 


Aortic Aneurysm 


R170G 


Tay-Sachs 

(Around 30 mutations described) 


G145R 


Mutated strain of HBV -vaccine escape mutant of 
hepatitis virus 
Mutation in the S gene 


R519C 
R75C 


Mild Spondyloepiphyseal dysplasia (SED) and 
precocious generalized osteoarthritis (OA) 


E134K 


Spinal Muscular Atrophy (SMA) - Mutation in the 
central Tudor-domain of the SMN protein 
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RVARW 
r\ f oov V 

in the XPF gene 


An - xeroaerma pigmentosum 
complementation group F (XP-F) 


K304E 


Medium chain acyl CoA dehydrogenase (MCAD) 
ueiiciency 


R519C 
collagen II 


Heterogeneous cartilage disorders including some 
chondrodysplasias and certain forms of heritable 
osieoarinniis 


D816T 
D816V 
D820G 


Systemic mast cell disease (SMCD) 


W184R 


Neonatal Type 2 Gaucher Disease 




U2938T polymorphism in Cytochrome P450 CYP2D6 
- responsible for the metabolism of approximately 20 - 
25% of prescription medicines. 


Q->W 

COL9A2 gene 
R->W 

COL9A3 gene 


MED - multiple epiphyseal dysplasia, (one type of 

Chondrodysplasia) 

Collagen IX mutations 


G1489E 

G286E 

G661R 

G1006E 

G589E 

G400E 

G1003D 

G934E 

G499D 

V793G 

G637S 

G910V 

G595C 


EHLERS-DANLOS SYNDROME (EDS) 

Mutations in the triple-helical domain of type III 
procollagen (COL3A1) 


11(31 ue 

(the first transmembrane 
domain of this seven- 
transmembrane domain 
G-protein-coupled 
receptor protein) 


civero lemai wnite syndrome (ULVvo) is an inherited 
syndrome of foals bom to American Paint Horse 
parents of the overo coat-pattern lineage. Affected 
foals are totally or almost totally white and die within 
days from complications due to intestinal 
aganglionosis. Related conditions occur in humans and 
rodents in which mutations in the endothelin receptor B 
(EDNRB) gene are responsible. 
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Table 2: Alzheimer mutations 



lUliitsitinn 

IwlU UIUUI i 






E693G 


'Arctic' APP mutation 


enhanced Ap protofibril formation 


E693Q 


Dutch mutation 


intracerebral hemorrhages 


A692G 


Flemish mutation 


Intracerebral hemorrhage. And 
individuals who survive develop a 
progressive dementia with features of 
AD 


E693K 


Italian mutation 


Clinical manifestations similar to 
Dutch patients 


D694N 


Iowa mutation 


Progressive dementia and severe 
cerebral anriyloidangiopathy 
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